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Title 

A COMPOUND IDENTIFICATION METHOD 

Various patent and non-patent references cited in the present application are 
5 hereby incorporated by reference in their entirety. 

Technical Field of the invention 

The present Invention relates to a method for obtaining the identity of one or 
more display molecules capable of associating with a molecular target. The 
1 0 display molecules initially form part of a library and the present invention 

devises a method for identifying such display molecules that possess certain 
properties relative to a terget. 

Background 

1 5 Methods for obtaining infonmation about a display molecule that possesses a 
binding characteristic towards a molecular target are used in the phage dis- 
play area and are generally known as panning. A typical panning protocol 
includes a library of phages displaying certain specific polypeptides and a 
target. When the library and the target are mixed, polypeptides that have an 

20 ability to bind to the target will form an association complex. Polypeptides 
that do not bind are washed away. 

In theory, it should be possible to rank the members of the library in accor- 
dance with their binding affinity in an elution step. Thus, the best binders 

25 could easily be identified following a single mixing step. However, practical 
experiments show that libraries of many members cannot properly be parti- 
tioned for good and bad binders and a second contacting with the target is 
necessary. Generally, libraries of the size 10«to 10^^ are used to increase 
the chance of finding a successfully binding polypeptide. Therefore, several 

30 rounds are generally required. To obtain a sufficient quantity of phages for 

performing a second contacting with a molecular target, the phages harbour- 
ing the binding polypeptides are amplified using the genetic material (DNA or 
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RNA) of the eluted binding phages. It is usual to perform 3 to 12 rounds of 
contact between the increasingly enriched libraries and the target before a 
minor group of binding peptides can be identified. 

Systems that resemble the phage panning have been evolved by the present 
applicant (WO 02/103008 A2) however with the possibility of displaying other 
molecules than polypeptides. Other systems using the same principles as the 
phage panning include WO 98/31700. WO 93/03172. and WO 00/23458. 

For some systems, e.g. as disclosed In EP 643 778 B1 and WO 93/06121 
A1. it is not immediately possible to perform amplification of the complexes 
after elution of binding display molecules associated with an identifying nu- 
cleic acid. 

The present invention aims at providing a method that only requires a single 
initial contact between a target and a library of display molecules associated 
with an identifying nucleic acid even for large libraries. Thus, reiterated ampli- 
fication of display molecules associated with an identifying or coding nucleic 
acid between each round of contact with the target can be avoided. More- 
over, encoding methods not previous available for large libraries becomes at 
the disposal of the person skilled in the art. 

Summary of the Invention 

The present invention relates to a method for obtaining display molecule(s) 
having affinity towards a target, comprising the steps of: a) providing a library 
comprising a plurality of different display molecules, each display molecule 
being associated with an identifier oligonucleotide, which codes for the iden- 
tity of said display molecule, b) contacting the library with a target to allow for 
an interaction between the display molecules of the library with the target, c) 
partitioning a fraction enriched with identifier oligonucleotides of display 
molecules interacting with the target, d) subjecting the fraction to denaturing 
conditions and subsequently to conditions at which homo-duplexes renatu- 
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rate, e) recovering the homo-duplexes, and f) deducing from the homo- 
duplexes the identity of the display molecule(s) interacting with the target 

An ideal initial library comprises an equal amount of each display molecule 
associated with the identifier oligonucleotide. The display molecule associ- 
ated with the identifier oligonucleotide is also for short tenmed "complex" in 
the following, it is estimated that a standard vial can comprise around 10^^ 
complexes. Therefore a library of 10« complexes Ideally comprises 10« cop- 
ies of each complex. 

The contacting between the library and the molecular target, and the subse- 
quent partitioning of a fraction of the complexes or Just the identifier oligonu- 
cleotides thereof, interacting with the target forms an imbalance in the 
amount of the individual members of the library. If elution Is used for the parti- 
tioning, complexes binding with high affinity to the target will be eluted in a 
relatively higher amounts compared to low affinity binding complexes, which 
will be eluted in a con-espondingly minor amount. 

The obtained imbalance at the nucleic acid level is used in the subsequent 
20 steps. The Identifier oligonucleotide parts of the binding complexes are usu- 
ally, but not necessarily, amplified by PGR or similar to obtain a higher total 
amount of oligonucleotides while retaining the proportion between the indi- 
vidual oligonucleotides. 

25 The PGR amplification provides a homo-duplex product. A mixture of homo- 
and hetero-duplexes is appropriately initiated by subjecting the PGR amplifi- 
cation product to a denaturing process to separate the duplexes into single 
stranded oligonucleotides. Subsequentiy. the mixture of single stranded oli- 
gonucleotides is allowed to renaturate to form the homo-duplexes. The dena- 
turing step is suitably obtained by heating the nucleic acids above the melting 
temperature of the duplexes and the hybridisation step is suitably conducted 
by lowering the temperature below the melting temperature of the homo- 
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duplexes and in some aspects of the Invention also below the melting tem- 
perature of at least a part of the hetero-duplexes. In the event the renaturing 
conditions allow the formation of hetero-duplexes. the presence of various 
different single stranded oligonucleotides some oligonucleotides may find a 
perfectly matching partner and form a homo-duplex, while other oligonucleo- 
tides will hybridise to non-complementary binding partners and form hetero- 
duplexes. In the event the renaturing conditions favours fonmatlon of homo- 
duplexes, while fomiatlon of hetero-duplexes mainly is avoided, e.g. by 
choosing a temperature below the melting temperature of the homo- 
duplexes, but above at least the majority of the hetero-duplexes. the renatur- 
ing step result predominately in the formation of homo-duplexes. The present 
invention takes advantage of the fact that the oligonucleotides most abundant 
easier will find perfect binding partners and fonn homo-duplexes. 

After the denaturing and subsequent renaturing step, the homo-duplexes are 
recovered. Usually, recovery Is conducted by eliminating or reducing the 
amount of hetero-duplexes and single stranded oligonucleotides. Several 
methods are available for reducing the amount of hetero-duplexes and single 
stranded oligonucleotides, including DHPLC as disclosed in US 5.795,976. 
and enzymatic degradation. In some aspects of the invention enzymatic deg- 
radation is preferred due to the availability of enzymes specifically locating 
one or more mis-match pairing nucleobases. 

The recovered pool of homo-duplexes may not fully be depleted for hetero- 
duplexes and single stranded oligonucleotides and/or the diversity of the pool 
may still be too high for a meaningful decoding to reveal a display molecule 
of interest. In an aspect of the invention, the step of subjecting the fraction to 
denaturing conditions and subsequently to conditions at which homo- 
duplexes renaturate and the step of recovering the homo-duplexes is there- 
fore repeated one or more times, i.e. the recovered pool of homo-duplexes Is. 
optionally after nucleic acid amplification, subjected to denaturing conditions 
to fomi single stranded oligonucleotides and subsequent to hybridisation con- 
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ditions to allow for the formation of a new pool of homo-duplexes; and the 
homo-duplexes is recovered by a suitable method as disclosed elsewhere 
herein. Between each repetition, a part of the homo-duplexes may be se- 
quenced to establish whether a further repetition Is necessary to identify a 
display molecule of Interest. 

The identity of the display molecules that possessed the ability of binding to 
the molecular target is finally revealed by some of the decoding the homo- 
duplexes, using conventional methods. When a sequence occurs more fre- 
quent than others this is normally an indication of the fact that a display 
molecule having the desired characteristics has been Identified. 

Detailed Description off the Invention 

The target may be of a biological origin or may be synthetic molecular target. 
Typically, the molecular target stems from an organism selected from human 
and animals, especially vertebras. However, in other embodiments the target 
may originate from a plant. In the quest for a compound with therapeutical 
effect on the human or animal body, the target is usually expected to have an 
importance in a therapeutically theory that combats a certain disease. In the 
quest for discovering compounds with plant protective effect, the target is 
usually expected to originate from, an organism that harms the crop or a 
competing undesired plant. The organism may be a fungus when a com- 
pound with fungicide effect is searched for or an insect when a compound 
having insecticide effect is desired. Optionally, a protein target stemming 
from a biological origin may be derivatised by altering, adding, or deleting 
one or more amino acids. 

The target may be a protein, a small molecular homnone. a lipid, a polysac- 
charide, a whole cell, a nucleic acid, a metabolite, a heme group, etc. In a 
preferred aspect the target is a protein. The protein may serve the function in 
the organism of being an enzyme, a hormone, a structural element, a regula- 
tory protein, a membrane channel or pump, a part of a signal transducing 
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cascade, an antibody, etc. Suitable target enzymes include kinases, phos- 
phatates. and proteases. The protein may occur as an independent entity or 
may be dimers, trimers. tetramers. or polymers and the protein may comprise 
a prosthetic group. Furthermore, the molecular target may be a soluble or 
insoluble agglomerate of one or more proteins and one or more substltuents 
occurring In the body or artificial components. In another prefen-ed embodi- 
ment, the molecular target is a nucleic acid, such as DNA or RNA aptamer or 
ribozyme. 

The target may be Immobilized to a solid support. The solid support can be a 
bead or the surfaces of a well. The target immobilized on the solid support 
may also fonn a stable or quasi-stable dispersion in the media. In a certain 
embodiment, the target is In solution and all the interaction occur In the solu- 
tion too. The absence of an immobilization step generally reduces the back- 
ground noise because there is no background surface to associate to. Thus 
the result of the assay may be more sensitive. In solution, the only back- 
ground noise imaginable is when the oligonucleotides or display molecules of 
the library of complexes binds unspecific to the target molecules. The ab- 
sence of an immobilization step generally necessitates a subsequent recov- 
ery step, e.g. by chromatography. 

In certain aspects of the invention, it is preferred to immobilize the target on a 
solid support. The solid support may be beads of a column or the surface of a 
container. The immobilisation of the molecular target may ease the removal 
of the non-binding complexes by washing or similar means. In a certain em- 
bodiment, a cleavable linkage between the molecular target and the solid 
support is present. The cleavable linker Is preferably selectively cleavable, 
that is. the linkage can be cleaved without cleaving other linkages in the tar- 
get or the complexes. The cleavage of the linkage between the molecular 
target and the solid support may reduce the contribution from the back- 
ground, such as complexes associated with the surface of the solid support 
and not binding to the molecular target. 
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The target may be obtained in any suitable way. A variety of targets are 
commercially available, either as purified protein or as the corresponding 
cDNA. Other protein or peptide targets may be Isolated from tissues or 
mRNA (or the corresponding cDNA) may be extracted from a tissue. Smaller 
peptides may be synthesised chemically using the standard solid-phase 
Fmoc peptide synthesis. When nucleic acids are used or Included in the mo- 
lecular target, it may be synthesised using the standard amedite synthesis 
method or by using the natural machinery. 



It may be an advantage to have all or at least a part of the identifier oligonu- 
cleotides on a double stranded form during the contacting with the molecular 
target, as certain nucleic acids may perform a binding interaction or a cata- 
lytical action on the components present during the contacting step. Thus, in 
15 one embodiment of the invention, the identifier oligonucleotide partly or fully 
Is hybridised to a complementing oligonucleotide. 

The Identifier oligonucleotide comprises the information necessary for decod- 
ing the identity of the display molecule. The identifier oligonucleotide may be 

20 analysed directly in some instances to reveal the identity of the display mole- 
cules that have performed an interaction with the target. The informative part 
can be decoded in a standard sequencing machine. In general however, it is 
preferred to include the infonmative part of the coupled product in to a suit- 
able vector and transfer the vector to a host organism. The host organisms 

25 may then be cultivated on a suitable substrate and allowed to form colonies. 

Samples from the colonies may be used for sequencing in a sequencing ma- 
chine. Also, the identification may be conducted using any sequencing 
method known in the art. including QPCR. microarrays. etc. 

30 nisplav molec "!'- «"H identifier nliaonucleotide association 

The display molecule associated with the identifier oligonucleotide Is some- 
times herein referred to as a bifunctional complex to indicate that a physical 
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connection between the display molecule and the identifier oligonucleotide 
normally is present. However, in certain embodiments of the present inven- 
tion the association between the display molecule and the Identifier oligonu- 
cleotide may be spatial, i.e. an identifier oligonucleotide specifies the spatial 
position of a display molecule. The tema "bifunctional complex" Is Intended 
also to cover the latter embodiment The identifier oligonucleotide comprises 
identifying moieties that Identily the display molecule. Preferably, the Identi- 
fier oligonucleotide identifies the molecule uniquely, i.e. In a library of com- 
plexes a particular identifier oligonucleotide is capable of distinguishing the 
molecule it is associated witii from the rest of the display molecules. 

The display molecule and the identifier oligonucleotide may be attached di- 
rectly to each other or through a bridging moiety. In one aspect of the Inven- 
tioh. the bridging moiety is a selectively cleavable linkage. 

The identifier oligonucleotide may comprise one. two or more codons. The 
codon sequences can be decoded to identify reactants used In the formation 
of the display molecule. When the identifier oligonucleotide comprises more 
than one codon. each member of a pool of chemical entities can be identified 
and the order of codons Is infomiative of the synthesis step each member 
has been Incorporated in. 

The sequence of the nucleotides in each codon may have any suitable 
length. The codon may be a single nucleotide or a plurality of nucleotides. In 
some aspects of the invention, it is preferred that each codon independentiy 
comprises four or more nucleotides, more prefen-ed 4 to 30 nucleotides. 

The identifier oligonucleotide will in general have at least two codons ar- 
ranged In sequence, i.e. next to each other. Two neighbouring codons may 
be separated by a framing sequence. Depending on ttie display molecule 
formed, the identifier oligonucleotide may comprise further codons. such as 
3. 4, 5, or more codons. Each of the further codons may be separated by a 
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suitable framing sequence. Preferably, all or at least a majority of the codons 
of the identifier oligonucleotide are separated from a neighbouring codon by 
a framing sequence. The framing sequence may have any suitable number 
of nucleotides, e.g. 1 to 20. Alternatively, codons on the Identifier oligonu- 
cleotide may be designed with overlapping sequences. 

The framing sequence, if present, may serve various purposes. In one setup 
of the invention, the framing sequence Identifies the position of the codon. 
Usually, the framing sequence either upstream or downstream of a codon 
comprises Information, which allows determination of the position of the 
codons. In another setup, the frames have alternating sequences, allowing 
for addition of building blocks from two pools in the fomrtatlon of the library. 
The framing sequence may also or in addition provide for a region of high 
affinity. The high affinity region may ensure that the hybridisation of the tem- 
plate with an antl-codon will occur In frame. Moreover, the framing sequence 
may adjust the annealing temperature to a desired level. 

A framing sequence with high affinity can be provided by incorporation of one 
or more nucleobases forming three hydrogen bonds to a cognate nucleo- 
base. Examples of nucleobases having this property are guanine and cyto- 
slne. Alternatively, or in addition, the framing sequence may be subjected to 
backbone modification. Several back bone modifications provides for higher 
affinity, such as 2'-0-methyl substitution of the ribose moiety, peptide nucleic 
acids (PNA). and 2'-4" O-methylene cyclisation of the ribose moiety, also re- 
fen-ed to as LNA (Locked Nucleic Acid). 

The Identifier oligonucleotide may comprise one or two flanking regions. The 
flanking region can encompass a signal group, such as a flourophor or a ra- 
dioactive group to allow for detection of the presence or absence of a com- 
plex or the flanking region may comprise a label that may be detected, such 
as biotin. When the identifier oligonucleotide comprises a biotin moiety, the 
Wentifier oligonucleotide may easily be recovered. 
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The flanking region(s) can also serve as priming sites for amplification reac- 
tions, such as PGR. The identifier oligonucleotide may in certain embodi- 
ments comprise an affinity region having the property of being able to hybrid- 
ise to a building block. The priming sites for PGR amplification may be identi- 
cal for all the identifier oligonucleotides to allow for a proportional amplifica- 
tion of the individual identifier oligonucleotides at various stages of the pre- 
sent method. Alternatively, the different priming sites vnth corresponding 
primers may be used to favour the amplification of certain groups of com- 
plexes in the library. 

It is to be understood that when the temi identifier oligonucleotide is used in 
the present description and claims, the identifier oligonucleotide may be in 
the sense or the anti-sense format, i.e. the identifier oligonucleotide can be a 
sequence of codons. which actually codes for the molecule or can be a se- 
quence complementary ttiereto. Moreover, the identifier oligonucleotide may 
be single-stranded or double-stranded, as appropriate. 

During the contacting step the Identifier oligonucleotide usually is in double 
stranded form to minimise any interaction relative to tine target. However, it 
may be suitable to establish the Identifier oligonucleotide in single stranded 
form prior to the formation of a mixture of the hetero- and homo-duplexes. 
Starting with a double stranded identifier oligonucleotide, a single stranded 
identifier oligonucleotide may easily be prepared by extension of a fonA^ard 
primer annealed at a priming site of the identifier oligonucleotide. 

The display molecule part of the complex is generally of a chemical structure 
expected of having an effect on the target When the target is of pharmaceu- 
tical importance, the molecule is generally a possible drug candidate. The 
complex may be formed by tagging a library of different possible drug candi- 
dates with a tag. e.g. a nucleic acid tag identifying each possible drug candi- 
date. In another embodiment of the invention, the molecule is encoded, i.e. 
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formed by a variety of reactants. which have reacted with each other and/or a 
scaffold molecule. Optionally, this reaction product may be post-modified to 
obtain the final molecule displayed on the complex. The post-modification 
may involve the cleavage of one or more chemical bonds attaching the en- 
5 coded molecule to the identifier In order more efficiently to display the en- 
coded molecule. In still another embodiment the display molecule is a poly- 
peptide formed using the natural machinery, such as the methods disclosed 
in WO 92/02536, WO 91/05058. and US 6.194,550. 

1 0 A variety or methods for association of an oligonucleotide to a polypeptide 
display molecule is available for the skilled person in the art. An option In- 
volves the association of a display molecule protein with the mRNA respon- 
sible for the formation thereof. This method is generally referred to as mRNA 
display. Optionally, the mRNA may be substituted with the corresponding 
15 cDNA. A method for generation such a single or a library effusions between 
a protein and the mRNA responsible for the fomnatlon thereof is disclosed in 
WO 98/31700. The corresponding DNA strand may be attached to the pro- 
tein using the method disclosed in WO 00/32823. The contents of both patent 
applications being incorporated in their entirety by reference herein. The 
20 method of WO 98/31700 includes providing a RNA stand comprising a trans- 
lation initiation sequence, a start codon operable linked to a protein encoding 
sequence, and a peptide acceptor at the 3' end and translating the protein 
encoding sequence to produce a RNA-protein fusion. According to WO 
00/32823 a DNA primer is covalently connected to the 3' end of the mRNA 
25 strand and extended by reverse transcriptase a to prepare the complement- 
ing DNA strand. The original RNA strand may be digested by RNase H. An- 
other suitable method for generating a library is disclosed in WO 01/90414. 
the content of which is Incorporated herein by reference. 

30 In accordance with another option, the identifier oligonucleotide is associated 
with a polypeptide display molecule using a method generally referred to as 
ribosome display. Rlbosome display is disclosed in WO 93/03172. the con- 
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tent of which is included herein by reference. A still further option for associa- 
tion is phage display, in which a polypeptide display molecule is presented on 
the capsule of the phage and the same capsule harbours the RNA or DNA 
responsible for the formation of the polypeptide. 

5 

A further option for associating the display molecule with an identifier oli- 
gonucleotide includes the method disclosed in M. Yonezawa etal. Nucleic 
acid research, 2003, vol. 31, No. 19 e118 (included by reference). The 
method includes the initial provision of an oligonucleotide connected to biotin 

1 0 and compartmentalization thereof together with a transcription and translation 
system. The oligonucleotide comprises a fusion gene coding for streptavldin 
and a polypeptide display molecule. After the formation of the fusion protein 
in each compartment, the streptavidin part of the fusion protein binds to the 
biotin moiety of the oligonucleotide, thereby associating the display molecule 

15 with the oligonucleotide coding for the identity thereof. 

In case the display molecule is a nucleic acid, it may be of the aptamer type, 
i.e. a library of aptamers comprising constant nucleic acid regions flanking a 
random oligonucleotide part. The random oligonucleotide part serves the 
20 dual function of a nucleic acid display molecule and the identifying oligonu- 
cleotide. 

The formation of a synthetic encoded molecule generally starts by a scaffold, 
i.e. a chemical unit having one or more reactive groups capable of forming a 

25 connection to another reactive group positioned on a chemical entity, thereby 
generating an addition to the original scaffold. A second chemical entity may 
react with a reactive group also appearing on the original scaffold or a 
reactive group incorporated by the first chemical entity. Further chemical 
entities may be involved in the formation of the final reaction product. The 

30 formation of a connection between the chemical entity and the nascent 

encoded molecule may be mediated by a bridging molecule. As an example, 
if the nascent encoded molecule and the chemical entity both comprise an 
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amine group a connection between these can be mediated by a dicarboxylic 
acid. A display molecule is in general produced in vitro and may be a 
naturally occurring or an artificial substance. In an aspect of the invention, a 
display molecule is not produced using the natural translation system In an in 
vitro process. In other aspects of the invention, the display molecule is a 
polypeptide produced using the natural translation machinery. 

The chemical entities that are precursors for structural additions or 
eliminations of the encoded molecule may be attached to a building block 
prior to the participation In the formation of the reaction product leading to the 
final display molecule. Besides the chemical entity, the building block 
generally comprises an anti-codon. In some embodiments the building blocks 
also comprise an affinity region providing for affinity towards the nascent 
complex. 

In a certain aspect of the invention, the reactants or chemical entities are 
suitably mediated to the nascent encoded molecule by a building block, 
which further comprises an antlcodon. The anti-codon serves the function of 
transfen-lng the genetic information of the building block in conjunction with 
the transfer of a chemical entity. The transfer of genetic information and 
chemical entity may occur in any order, however, it is important that a 
correspondence is maintained in the complex. The chemical entities are 
preferably reacted without enzymatic Interaction in some aspects of the 
invention. Notably, ribosomes or enzymes having similar activity do 
preferably not mediate the reaction of the chemical entities. In another aspect 
of the invention a ribosome is used to translate an mRNA into a protein using 
a tRNA loaded with a natural or unnatural amino acid. In still another aspect 
of the invention, enzymes having catalytic activities different from that of 
ribosomes are used in the fomnation of the display molecule. 

According to certain aspects of the invention the genetic information of the 
anti-codon is transferred by specific hybridisation to a codon on a nucleic 
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acid template. Another method for transferring the genetic information of the 
anti-codon to the nascent complex is to anneal an oligonucleotide 
complementary to the anti-codon and attach this oligonucleotide to the 
complex, e.g. by ligation. A still further method involves transferring the 
5 genetic information of the anti-codon to the nascent complex by an extension 
reaction using a polymerase and a mixture of dNTPs. 

The chemical entity of the building block may in certain cases be regarded as 
a precursor for the structural entity eventually incorporated into the encoded 

10 molecule. In other cases the chemical entity provides for the eliminations of 
chemical units of the nascent encoded molecule. Therefore, when it in the 
present application with claims is stated that a chemical entity is reacted with 
a nascent encoded molecule it is to be understood that not necessarily all the 
atoms of the original chemical entity is to be found in the eventually formed 

15 encoded molecule. Also, as a consequence of the reactions involved In the 
connection, the structure of the chemical entity can be changed when it ap- 
pears on the nascent encoded molecule. Especially, the cleavage resulting in 
the release of the entity may generate a reactive group, which in a subse- 
quent step can participate In the formation of a connection between a nas- 

20 cent complex and a chemical entity. 

The chemical entity of the building block comprises at least one reactive 
group capable of participating in a reaction, which results in a connection be- 
tween the chemical entity of the building block and another chemical entity or 

25 a scaffold associated with the nascent complex. The number of reactive 

groups, which appears on the chemical entity, is suitably one to ten. A build- 
ing block featuring only one reactive group is used La. in the end positions of 
polymers or scaffolds, whereas building blocks having two reactive groups 
are suitable for the formation of the body part of a polymer or scaffolds capa- 

30 ble of being reacted further. One, two or more reactive groups intended for 
the formation of connections are typically present on scaffolds. Non-limiting 



iiqdto from the IFW Imaqe Database on 03/02/2005 



15 

examples of scaffolds are opiates, steroids, benzodiazepines, hydantoines, 
and peptidylphosphonates. 



The reactive group of the chemical entity may be capable of forming a direct 
5 connection to a reactive group of the nascent complex or the reactive group 
of the building block may be capable of forming a connection to a reactive 
group of the nascent complex through a bridging fill-in group. It is to be un- 
derstood that not all the atoms of a reactive group are necessarily maintained 
in the connection formed. Rather, the reactive groups are to be regarded as 
1 0 precursors for the structure of the connection. 

The subsequent cleavage step to release the chemical entity from the build- 
ing block can be performed in any appropriate way. In an aspect of the inven- 
tion the cleavage involves usage of a chemical reagent or an enzyme. The 

15 cleavage results in a transfer of the chemical entity to the nascent encoded 
molecule or in a transfer of the nascent encoded molecule to the chemical 
entity of the building block. In some cases it may be advantageous to intro- 
duce new chemical groups as a consequence of linker cleavage. The new 
chemical groups may be used for further reaction in a subsequent cycle, ei- 

20 ther directly or after having been activated. In other cases it is desirable that 
no trace of the linker remains after the cleavage. 

In another aspect, the formation of connection between chemical entity and 
nascent encoded molecule and the cleavage between chemical entity and 

25 the remainder of the building block is conducted as a simultaneous reaction, 
i.e. either the chemical entity of the building bjock or the nascent encoded 
molecule is a leaving group of the reaction. In some aspects of the invention, 
it is appropriate to design the system such that the connection and the cleav- 
age occur simultaneously because this will reduce the number of steps and 

30 the complexity. The simultaneous connection and cleavage can also be de- 
signed such that either no trace of the linker remains or such that a new 
chemical group for further reaction Is Introduced, as described above. 
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The attachment of the chemical entity to the building block, optionally via a 
suitable spacer can be at any entity available for attachment, e.g. the chemi- 
cal entity can be attached to a nucleobase or the backbone. In general. It is 
preferred to attach the chemical entity at the phosphor of the Intemucleoslde 
linkage or at the nucleobase. When the nucleobase is used for attachment of 
the chemical entity, the attachment point Is usually at the 7 position of the 
purines or 7-deaza-purins or at the 5 position of pyrimldines. The nucleotide 
may be distanced from the reactive group of the chemical entity by a spacer 
moiety. The spacer may be designed such that the confomnational spaced 
sampled by the reactive group is optimized for a reaction with the reactive 
group of the nascent encoded molecule. 

The display molecules of the invention may have any chemical structure. In a 
prefen-ed aspect, the display molecule can be any compound that may be 
synthesized in a component-by-component fashion. In some aspects the 
display molecule is a linear or branched polymer. In another aspect the 
display molecule is a scaffolded molecule. The term "display molecule" also 
comprises naturally occuning molecules like a-polypeptldes etc, however 
produced In vitro usually in the absence of enzymes, like ribosomes. In 
certain aspects, the display molecule of the library is a non-a-polypeptide. 

The display molecule may have any molecular weight. However, in order to 
be orally available, it Is in this case preferred that the display molecule has a 
molecular weight less than 2000 Daltons, preferably less than 1000 Dalton. 
and more preferred less than 500 Daltons. 

The size of the library may vary considerably pending on the expected result 
of the Inventive method. In some aspects, it may be sufficient that the library 
comprises two, three, or four different complexes. However, in most events, 
more than two different complexes are desired to obtain a higher diversity. In 
some aspects, the library comprises 1,000 or more different complexes; more 
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preferred 1 .000,000 or more different complexes. The upper Hmit for the size 
of the library is only restricted by the size of the vessel in which the library is 
comprised. It may be calculated that a vial may comprise up to lO^'' different 
complexes. 

^/lf»t|iQds for formino libraries of nnmolexes 

The complexes comprising an Identifier oligonucleotide having two or more 
codons that codes for reactants that have reacted in the formation of the 
molecule part of the complex may be fonmed by a variety of processes. Gen- 
erally, the preferred methods can be used for the formation of virtually any 
kind of encode molecule. Suitable examples of processes include prior art 
methods disclosed in WO 93/20242. WO 93/061 21 , WO 00/23458. WO 
02/074929. and WO 02/103008. the content of which being incorporated 
herein by reference as well as methods of the present applicant not yet public 
available. Including the methods disclosed in PCT/DK03/00739 filed 30 Oc- 
tober 2003. and DK PA 2003 00430 filed 20 March 2003. Any of these meth- 
ods may be used, and the entire content of the patent applications are in- 
cluded herein by reference. 

Below five presently preferred embodiments are described. A first 
embodiment disclosed in more detail In WO 02/103008 is based on the use 
of a polymerase to incorporate unnatural nucleotides as building bloclcs. 
Initially, a plurality of template oligonucleotides is provided. Subsequently 
primer^ are annealed to each of the templates and a polymerase is extending 
the primer using nucleotide derivatives, which have appended chemical 
entities. Subsequent to or simultaneously with the incorporation of the 
nucleotide derivatives, the chemical entitles are reacted to fbmn a reaction 
product. The encoded molecule may be post-modified by cleaving some of 
the linking moieties to better present the encoded molecule. 



Several possible reaction approaches for the chemical entities are apparent. 
First, the nucleotide derivatives can be Incorporated and the chemical entitle 
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subsequently polymerised. In the event the chemical entities each carry two 
reactive groups, the chemical entities can be attached to adjacent chemical 
entities by a reaction of these reactive groups. Exemplary of the reactive 
groups are amine and carboxyllc acid, which upon reaction form an amide 
bond. Adjacent chemical entities can also be linked together using a linking 
or bridging moiety. Exemplary of this approach is the linking of two chemical 
entitles each bearing an amine group by a bl-carboxyllc acid. Yet another 
approach is the use of a reactive group between a chemical entity and the 
nucleotide building block, such as an ester or a holster group. An adjacent 
building block having a reactive group such as an amine may cleave the 
Interspaced reactive group to obtain a linkage to the chemical entity, e.g. by 
an amide linking group. 

A second embodiment for obtalnmer^t of complexes disclosed In WO 
02/103008 pertains to the use of hybridisation of building blocks to a template 
and reaction of chemical entitles attached to the building blocks in order to 
obtain a reaction product. This approach comprises that templates are 
contacted with a plurality of building blocks, wherein each building block 
comprises an anti-codon and a chemical entity. The antl-codons are 
designed such that they recognise a sequence. I.e. a codon. on the template. 
Subsequent to the annealing of the antl-codon and the codon to each other a 
reaction of the chemical entity Is effected. 

The template may be associated with a scaffold. Building blocks bringing 
chemical entitles in may be added sequentially or simultaneously and a 
reaction of the reactive group of the chemical entity may be effected at any 
time after the annealing of the building blocks to the template. 

A third embodiment for the generation of a complex includes chemical or 
enzymatic ligation of building blocks when these are lined up on a template. 
Initially, templates are provided, each having one or more codons. The 
templates are contacted with building blocks comprising anti-codons linked to 
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chemical entities. The two or more anti-codons annealed on a template are 
subsequently ligated to each other and a reaction of the chemical entitles Is 
effected to obtain a reaction product. The method Is disclosed In more detail 
in DK PA 2003 00430 filed 20 March 2003. 

A fourth embodiment makes use of the extension by a polymerase of an 
affinity sequence of the nascent complex to transfer the antl-codon of a 
building block to the nascent complex. The method implies that a nascent 
complex comprising a scaffold and an affinity region Is annealed to a building 
block comprising a region complementary to the affinity section. 
Subsequently, the antl-codon region of the building block Is transferred to the 
nascent complex by a polymerase. The transfer of the chemical entity may 
be transferred prior to. simultaneously with or subsequent to the transfer of 
the antl-codon. This method is disclosed In detail in PCT/DK03/00739. 

A fifths embodiment also disclosed In PCT/DK03/00739 comprises reaction 
of a reactant with a site reaction site on nascent bifunctional molecule and 
addition of a nucleic acid tag to the nascent bifunctional molecule using an 
enzyme, such as a ligase. \Nhen a library is formed, usually an array of 
compartments Is used for reaction of reactants and enzymatic addition of 
tags with the nascent bifunctional molecule. 

Thus, the codons are either pre-made into one or more templates before the 
encoded molecules are generated or the codons are transferred 
simultaneously with the fomiatlon of the encoded molecules. 

After or simultaneously with the formation of the reaction product some of the 
linkers to the template may be cleaved, however, usually at least one linker is 
maintained to provide for the complex. 
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Nucleotides 

The nucleotides used in the present invention may be linked together in a 
sequence of nucleotides, i.e. an oligonucleotide. Each nucleotide monomer is 
normally composed of two parts, namely a nucleobase moiety, and a back- 
bone. The backbone may in some cases be subdivided Into a sugar moiety 
and an internucleoside linker. 

The nucleobase moiety may be selected among naturally occumng nucleo- 
bases as well as non-naturally occurring nucleobases. Thus, "nucleobase" 
includes not only the known purine and pyrimidine hetero-cycles. but also 
heterocyclic analogues and tautomers thereof. Illustrative examples of nu- 
cleobases are adenine, guanine, thymine, cytosine. uracil, purine, xanthine, 
diaminopurine, 8-oxo-N«-methyladenlne. 7-deazaxanthine. 7-deazaguanine. 
N^N^-ethanocytosln. N« N«-ethano-2.6-diamlno-purine, 5-methylcytosine. 5- 
(C^-C^)-alkynylcytosine. 5-fluorouracil, 5-bromouracil. pseudoisocytosine, 2- 
hydroxy-5-methyl-4-trlazolopyrldine. Isocytosine. isoguanine. inosine and the 
"non-naturally occurring" nucleobases described in Benner et al.. U.S. Pat 
No. 5.432.272. The tenn "nucleobase" is intended to cover these examples 
as well as analogues and tautomers thereof. Especially interesting nucleo- 
bases are adenine, guanine, thymine, cytosine. 5-methylcytosine. and uracil, 
which are considered as the naturally occumng nucleobases in relation to 
therapeutic and diagnostic application in humans. 



Examples of suitable specific pairs of nucleobases are shown 
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Natural Base Pairs 



NH2 HN 
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R=CH3: Thymine Cytosine 
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Synthetle purine bases pairring with natural pyrimldines 
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7-deaza guanine 



Suitable examples of backbone units are shown below (B denotes a nucleo- 
base): 
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The sugar moiety of the backbone is suitably a pentose but may be the ap- 
5 propriate part of a PNA or a six-member ring. Suitable examples of possible 
pentoses include ribose. Z-deoxyrlbose. 2'-0-methyl-ribose. 2'-flour-ribose. 
and 2'-4'-0-methylene-ribose (LNA). Suitably the nucleobase is attached to 
the 1' position of the pentose entity. 

10 An Intemucleoside linker connects the 3' end of preceding monomer to a 5' 
end of a succeeding monomer when the sugar moiety of the backbone is a 
pentose, like ribose or 2-deoxyribose. The intemucleoside linkage may be 
the natural occurring phospodiester linkage or a derivative thereof. Exam- 
ples of such derivatives include phosphorothioate. methylphosphonate, 

15 phosphoramidate. phosphotriester. and phosphodithioate. Furthermore, the 
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Internucleoside linker can be any of a number of non-phosphorous-containing 
linkers known in the art. 

Preferred nucleic acid monomers include naturally occurring nucleosides 
forming part of the DNA as well as the RNA family connected through phos- 
phodiester linkages. The members of the DNA family Include de- 
oxyadenosine. deoxyguanosine. deoxythymldine. and deoxycytidine. The 
members of the RNA family include adenosine, guanosine. uridine, cytldlne. 
and inosine. Inosine is a non-specific pairing nucleoside and may be used as 
universal base because inosine can pair neariy isoenergetically with A. T. 
and C. Other compounds having the same ability of non-specifically base- 
pairing with natural nucleobases have been fonmed. Suitable compounds 
which may be utilized in the present invention Includes among others the 
compounds depicted below 
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Examples of TJnivers al Bases: 
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Building block 

The chemical entities or reactants that are precursors for structural additions 
or eliminations of the encoded molecule may be attached to a building block 
5 prior to the participation in the fonmatlon of the reaction product leading to the 
final encoded molecule. Besides the chemical entity, the building block gen- 
erally comprises an anti-codon. 

The chemical entity of the building block comprises at least one reactive 
10 group capable of participating in a reaction, which results in a connection be- 
tween the chemical entity of the building block and another chemical entity or 
a scaffold associated with the nascent complex. The connection is facilitated 
by one or more reactive groups of the chemical entity. The number of reac- 
tive groups, which appear on the chemical entity, is sultatjiy one to ten. A 
1 5 building block featuring only one reactive group is used l.a. in the end posi- 
tions of polymers or scaffolds, whereas building blocks having two reactive 
groups are suitable for the fomnation of the body part of a polymer or scaf- 
folds capable of being reacted further. One. two or more reactive groups in- 
tended for the formation of connections are typically present on scaffolds. 

20 

The reactive group of the building block may be capable of forming a direct 
connection to a reactive group of the nascent complex or the reactive group 
of the building block may be capable of forming a connection to a reactive 
group of the nascent complex through a bridging fill-in group. It is to be un- 
25 derstood that not all the atoms of a reactive group are necessarily maintained 
in the connection formed. Rather, the reactive groups are to be regarded as 
precursors for the structure of the connection. 

The subsequent cleavage step to release the chemical entity from the bulld- 
30 Ing block can be performed in any appropriate way. In an aspect of the inven- 
tion the cleavage involves usage of a reagent or an enzyme. The cleavage 
results In a transfer of the chemical entity to the nascent encoded molecule 
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or In a transfer of the nascent encoded molecule to the chemical entity of the 
building block. In some cases it may be advantageous to introduce new 
chemical groups as a consequence of linker cleavage. The new chemical 
groups may be used for further reaction in a subsequent cycle, either directiy 
or after having been activated. In other cases it is desirable that no trace of 
the linker remains after the cleavage. 

In another aspect, the connection and the cleavage are conducted as a si- 
multaneous reaction. I.e. either the chemical entity of the building block or the 
nascent encoded molecule is a leaving group of the reaction. In general, it is 
preferred to design the system such that the connection and the cleavage 
occur simultaneously because this will reduce the number of steps and the 
complexity. The simultaneous connection and cleavage can also be designed 
such that either no trace of ttie linker remains or such that a new chemical 
group for further reaction is introduced, as described above. 

The attachment of the chemical entity to the building block, optionally via a 
suitable spacer can be at any entity available for attachment, e.g. the chemi- 
cal entity can be attached to a nucleobase or the backbone. In general, It is 
preferred to attach the chemical entity at ttie phosphor of the intemucleoside 
linkage or at the nucleobase. When the nucleobase is used for attachment of 
the chemical entity, the attachment point is usually at the 7 position of the 
purines or 7-deaza-purins or at ttie 5 position of pyrimidines. The nucleotide 
may be distanced from ttie reactive group of the chemical entity by a spacer 
moiety. The spacer may be designed such that the conformational space 
sampled by the reactive group is optimized for a reaction with the reactive 
group of ttie nascent encoded molecule or reactive site. 

The anticodon complements the codon of the identifier oligonucleotide se- 
quence and generally comprises the same number of nucleotides as ttie 
codon. The anticodon may be adjoined with a fixed sequence, such as a se- 
quence complementing a framing sequence. 
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Various specific building blocks are envisaged. Building blocks of particular 
interest are shown below. 



Building blocks transferring a chemical entity to a recipient nucleophilic group 
The building block indicated below Is capable of transfenring a chemical entity 
(CE) to a recipient nucleophilic group, typically an amine group. The bold 
lower horizontal line illustrates the building block comprising an anti-codon 
and the vertical line illustrates a spacer. The 5-membered substituted N- 
hydroxysuccinimid (NHS) ring serves as an activator, i.e. a labile bond is 
formed between the oxygen atom connected to the NHS ring and the chemi- 
cal entity. The labile bond may be cleaved by a nucleophilic group, e.g. posi- 
tioned on a scaffold 




CE 



The 5-membered substituted N-hydroxysuccinimid (NHS) ring sen/es as an 
activator, i.e. a labile bond is fomied between the oxygen atom connected to 
the NHS ring and the chemical entity. The labile bond may be cleaved by a 
nucleophilic group, e.g. positioned on a scaffold, to transfer the chemical en- 
tity to the scaffold, thus converting the remainder of the fragment Into a leav- 
ing group of the reaction. When the chemical entity is connected to the acti- 
vator through a carbonyl group and the recipient group is an amine, the bond 
fomied on the scaffold will an amide bond. The above building block is the 
subject of WO03078627A2, the content of which is incorporated herein in 
their entirety by reference. 



• » » 
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Another building block, which may form an amide bond, is 




R may be absent or NO2. CF3. halogen, preferably CI. Br, or I. and Z may be 
S or O. This type of building block is disclosed in WO03078626A2. The con- 
tent of this patent application is incorporated herein in the entirety by refer- 



ence. 



10 A nucleophilic group can cleave the linkage between Z and the carbonyl 
group thereby transferring the chemical entity -(C=0)-CE' to said nucleo- 
philic group. 

Building blocks transferring a chemical entity to a recipient reactive group 
15 forming a C=C bond 

A building block as shown below Is able to transfer the chemical entity to a 
recipient aldehylde group thereby forming a double bond between the carbon 
of the aldehyde and the chemical entity 

20 
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The above building block is disclosed In WO03078445A2. the content of 
which being incorporated herein in the entirety by reference. 



10 



Building blocks transferring a chemical entity to a recipient reactive group 
forming a C-C bond 

The below building block is able to transfer the chemical entity to a recipient 
group thereby forming a single bond between the receiving moiety, e.g. a 
scaffold, and the chemical entity. 



15 



The above building block Is disclosed in WO03078445A2. the content of 
which being incorporated herein in the entirety by reference. 

Another building block capable of transferring a chemical entity to a receiving 
reactive group forming a single bond is 
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O CE 



The receiving group may be a nucleophile, such as a group comprising a 
hetero atom, thereby forming a single bond between the chemical entity and 
5 the hetero atom, or the receiving group may be an electronegative carbon 

atom, thereby forming a C-C bond between the chemical entity and the scaf- 
fold- The above building blocl^ is disclosed in WO03078446A2, the content of 
which IS incorporated herein by reference. 

10 The chemical entity attached to any of the above building bloclcs may be a 
selected from a large arsenal of chemical structures. Examples of chemical 
entities are 

H or entities selected among the group consisting of a Ci-Ce alkyi, C2-C6 al- 

kenyl, Cz-Cq alkynyl, C4-C8 ajkadienyl, C3-C7 cycloalkyi, C3-C7 cycloheteroal- 
15 kyl, aryl, &nd heteroaryl, said group being substituted with 0-3 0-3 and 

0-3 R^or C1-C3 alkylene-NR'^a, C1-C3 alkylene-NR'*C(0)R®, C1-C3 al- 

kylene-NR'^CCOpR^. C1-C2 alkylene-O-NR'^2, C1-C2 alkyIene-0-NR*C(0)R®, 

C1-C2 alkylene-0-NR'^C(0)OR^ substituted with 0-3 R^ 

where R"^ is H or selected independently among the group con- 
20 sisting of Ci-Ce alkyI, Ca-Ce alkenyl, Cg-Ce alkynyl, C3-C7 cycloalkyi, C3-C7 

cycloheteroalkyi, aryl, heteroaryl, said group being substituted with 0-3 R^ 

and 

R^ is selected independently from -N3, -CNO, -C(NOH)NH2. 
-NHOH, -NHNHR^. -C(0)R®, -SnR^s, -B(OR^)2, -P{0)(0R^)2 or the group 
25 consisting of Ca-Ce alkenyl, C2-C6 alkynyl, C4-C8 alkadienyl said group being 
substituted with 0-2 R^, 
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where is selected independently from H. Ci-Ce alkyl. C3.C7 
cycloalkyl. aryl or Ci-Ce alkylene-aryl substituted with 0-5 halogen atonns se- 
lected from -F. -CI, -Br, and -I; and ^ 

r7 is Independently selected from -NO2. -COOR®. -COR . -CN. 

-OSIR®3. -OR^ and -NR^a- 

r8 is H. Ci-Ce alkyl. Cz-Ce alkenyl. Cz-Ce alkynyl. C3-C7 cycloal- 
kyl. aryl or Ci-Ce alkylene-aryl substituted with 0-3 substltuents Independ- 
ently selected from -F. -CI. -NO2. -R'. -OR^ -SiR^ 

r9 ,3 =0. -F, -CI. -Br. -I. -CN, -NO2. -OR«. -NR^. -NR«-C(0)R«. 
-NR«-C(0)OR«. -SR^ -S(0)R^ -S(0)2R^ -COOR«. -C(0)NR^ and 
-S(0)2NR^2- 



Cross-link cleavage building blocks 

It may be advantageous to split the transfer of a chemical entity to a recipient 
reactive group Into two separate steps, namely a cross-linking step and a 
cleavage step because each step can be optimized. A suitable building block 
for this two-step process Is illustrated below: 




initially, a reactive group appearing on the functional entity precursor (abbre- 
viated FEP) reacts with a recipient reactive group, e.g. a reactive group ap- 
pearing on a scaffold, thereby fomiing a cross-link. Subsequently, a cleavage 
is performed, usually by adding an aqueous oxidising agent such as I2. Brz. 
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Cl2. H*. or a Lewis acid. The cleavage results In a transfer of the group HZ- 
FEP- to the recipient moiety, such as a scaffold. 

In the above formula 

Z is O. S, NR* 
Q is N, CR^ 

P is a valence bond, O, S, NR"*. or a group Cs-rarylene, Ci. 
ealkylene, d-eO-alkylene, Ci^S-aIkylene. NR^-alkylene. Ci-ealkylene-O, Ci. 
ealkylene-S option said group being substituted with 0-3 R^ 0-3 R« and 0-3 
rs or C1-C3 alkylene-NR*2. C1-C3 alkylene-NR*C(0)R«, C1-C3 al- 
kylene-NR'*C(0)OR«, C1-C2 alkylene-O-NR'^a. C1-C2 alkylene-0-NR''C(0)R^ 
C1-C2 alkylene-0-NR^C(0)OR« substituted with 0-3 R®, 

B is a group comprising D-E-F, in which 

D Is a valence bond or a group Ci-ealkylene. Ci^alkenylene. Ci. 
ealkynylene. Cs-Tarylene. or C^yheteroarylene. said group optionally being 

substituted with 1 to 4 group R^\ 

E is, when present, a valence bond. O, S. NR'*. or a group Ci. 
ealkylene. Ci.6alkeny!ene. Ci.6alkynylene. Cs-zarylene. or Cs-yheteroarylene. 
said group optionally being substituted with 1 to 4 group 

F is, when present. .a valence bond, O, S, or NR , 
A is a spacing group distancing the chemical stnicture from the 
complementing element, which may be a nucleic acid, 

r\ R^. and R^ are independent of each other selected among 
the group consisting of H. Ci-Ce alkyl. Cz-Ce alkenyl. Cz-Ce alkynyl, C4-C8 
alkadlenyl. C3-C7 cycloalkyi, C3-C7 cycloheteroalkyl. aryl. and heteroaryl. said 
group being substituted with 0-3 R^ 0-3 R^ and 0-3 R^or C1-C3 al- 
kylene-NR^2. C1-C3 alkylene-NR^C(0)R^ C1-C3 alkylene-NR^C(0)OR°, C1-C2 
alkylene-0-NR^2. C1-C2 aIkylene-0-NR^C(0)R^ C1-C2 al- 
kylene-0-NR'*C(0)OR° substituted with 0-3 R^, 

FEP is a group selected among the group consisting of H, Ci-Ce 
alkyl. C2-C6 alkenyl, C2-C6 alkynyl. C4-C8 alkadlenyl. C3-C7 cycloalkyi. C3-C7 
cycloheteroalkyl. aryl. and heteroaryl. said group being substituted with 0-3 
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R'*. 0-3 and 0-3 R^or C1-C3 alkylene-NR^a. C1-C3 alkylene-NR*C(0)R«, 
C1-C3 alkylene-NR'^CCOpR^. C1-C2 alkylene-O-NR^'a. C1-C2 al- 
kylene-0-NR^C(0)R«. C1-C2 alkylene-O-NR'^CCOpR^ substituted with 0-3 

R^. 

where R* is H or selected independently among the group con- 
sisting of Ci-Ce alkyi. Ca-Ce alkenyl. Cz-Ce alkynyl. C3-C7 cycloalkyl. C3-C7 
cycloheteroaikyi. aryl. heteroaryl. said group being substituted with 0-3 R« 
and 

r5 is selected independently from -N3. -CNO. -C(NOH)NH2. 
-NHOH. -NHNHR^ -C(0)R^ -SnR^s. -B(OR«)2. -P(0)(0R«)2 or the group 
consisting of Cz-Ce alkenyl. Ca-Ce alkynyl. C4-C8 alkadienyl said group being 

substituted with 0-2 R^, 

where R^ is selected independently from H. Ci-Ce alkyl. C3-C7 
cycloalkyl. aryl or Ci-Ce alkylene-aryl substituted with 0-5 halogen atoms se- 
lected from -F, -CI. -Br. and -I: and R^ Is independently selected from -NO2. 
-COOR^, -COR^. -CN, -OSiRS. -OR^ and -NR^2. 

R« is H, C1-C6 alkyl. Ca-Cs alkenyl. C2-C6 alkynyl. C3-C7 cycloalkyl. aryl or 
Ci-Ce alkylene-aryl substituted with 0-3 substituents independently selected 
from -F. -CI. -NO2. -R^ -OR^. -SiR^ 

is =0. -F. -CI. -Br. -I. -CN. -NO2. -OR«. -NR«2. -NR«-C(0)R«, 
-NR«-C(0)OR«. -SR«. -S(0)R^ -S(0)2R^ -COOR^ -C(0)NR«2 and 
-S(0)2NR^2. 

In a preferred embodiment Z is O or S. P is a valence bond. Q is CH. B is 
CH2. and R\ R^. and R^ is H. The bond between the carbonyl group and Z is 
cleavable with aqueous I2. 

nnntaf^tinn hetweep tarqi=.t and library 

The contacting step, by which the library of bifunctlonal molecules is sub- 
jebted under binding conditions to a target, may be referred to as the enrich- 
ment step or the selection step, as appropriate, and includes the screening of 
the library for display molecules having predetermined desirable characteris- 
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tics. Predetermined desirable characteristics can include binding to a target, 
catalytically changing the target, chemically reacting with a target in a man- 
ner which alters/modifies the target or the functional activity of the target, and 
covalently attaching to the target as in a suicide inhibitor. 

In theory, display molecules of interest can be selected based on their prop- 
erties using either physical or physiological procedures. The method pre- 
fen-ed according to the present invention is to enrich molecules with respect 
to binding affinity towards a target of interest. In a certain embodiment, the 
basic steps involve mixing the library of complexes with the target of interest. 
The target can be attached to a column matrix or microtitre wells with direct 
immobilization or by means of antibody binding or other high-affinity interac- 
tions. In another embodiment, the target and displayed molecules interact 
without immobilisation of the target. Displayed molecules that bind to the tar- 
get will be retained on this surface, while nonbinding displayed molecules in 
a certain aspect of the invention will be removed during a single or a series of 
wash steps. The identifier oligonucleotides of complexes bound to the target 
can then be recovered. It may be considered advantageously to perform a 
chromatography step after or instead of the washing step, notably in cases 
where the target is not immobilized. After the recovery of the identifier oli- 
gonucleotides they are optionally amplified before the decoding step. 

A significant reduction in background binders may be obtained with increased 
washing volumes, repeating washing steps, higher detergent concentrations 
and prolonged incubation during washing. Thus, the more volume and num- 
ber of steps used in the washing procedure together with more stringent con- 
ditions the more efficiently the non-binders and background binders will be 
removed. The right stringency in the washing step can also be used to re- 
move low-affinity specific binders. However, the washing step will also re- 
move wanted binders if too harsh conditions are used. 
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A blocking step, such as incubation of solid phase with skimmed milk pro- 
teins or other inert proteins and/or mild detergent such as Tween-20 and Tri- 
ton X-100. may also be used to reduce the background. The washing condi- 
tions should be as stringent as possible to remove background binding but to 
5 retain specific binders that interact with the target Generally, washing condi- 
tions are adjusted to maintain the desired affinity binders, e.g. binders In the 
micro molar, nanomolar, or picomolar range. 

The present Invention takes advantages of the fact that the identifier oligonu- 
10 cleotides of low-binding complexes will be in a low concentration compared 
to the identifier oligonucleotides of complexes binding with high affinity. The 
generated imbalance can be enhanced in the subsequent fomiation of a mix- 
ture of homo- and hetoro-duplexes and the recovery of homo-duplexes. 

15 The target can be any compound of interest. E.g. the target can be a protein, 
peptide, carbohydrate, polysaccharide, glycoprotein, hormone, receptor, an- 
tigen, antibody, virus, substrate, metabolite, transition state analogue, cofac- 
tor. inhibitor, drug. dye. nutrient, growth factor, cell, tissue, etc. without limita- 
tion Suitable targets include, but are not limited to. angiotensin converting 
20 enzyme, renin, cyclooxygenase. 5-lipoxygenase. IIL- 1 0 converting enzyme, 
cytokine receptors, PDGF receptor, type II inosine monophosphate dehydro- 
genase, p-lactamases. Integrin. proteases like factor Vila, kinases like Bcr- 
Abl/Her. phosphotases like PTP-1B. and fungal cytochrome P-450. Targets 
can include, but are not limited to. bradykinin. neutrophil elastase, the HIV 
25 proteins, including tat, rev, gag, int. RT. nucleocapsid etc., VEGF. bFGF. 

TGFP. KGF. PDGF. GPCR, thrombin, substance P. IgE, sPLA2. red blood 
cells, glioblastomas, fibrin clots. PBMCs. hCG. lectins, selectins. cytokines, 
ICP4. complement proteins, etc. 

30 A target can also be a surface of a non-biological origin, such as a polymer 
surface or a metal surface. The method of the invention may then be used to 
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identify suitable coatings for such surfaces. 

In a preferred err^bodiment. the desirable display molecule acts on the target 
without any interaction between the nucleic acid attached to the desirable 
encoded molecule and the target. In one embodiment, the bound complex- 
target aggregate can be partitioned from unbound complexes by a number of 
methods. The methods include nitrocellulose filter binding, column chroma- 
tography, filtration, affinity chromatography, centrifugation. and other well 
known methods. A preferred method is size-exclusion chromatography. 

Briefly, the library of complexes is subjected to the target, which may include 
contact between the library and a column onto which the target is immobi- 
lised. Identifier oligonucleotides associated with undesirable display mole- 
cules. i.e. display molecules not bound to the target under the stringency 
conditions used, will pass through the column. Additional undesirable display 
molecules (e.g. display molecules which cross-react with other targets) may 
be removed by counter-selection methods. Desirable complexes are bound 
to the column. The target may be immobilized in a number of ways. In one 
embodiment, the target is immobilized through a cleavable physical link, such 
as one more chemical bonds. 

The complex may be provided with a cleavable linker at a position between 
the display molecule and the identifier oligonucleotide. When the target is 
immobilized, the cleavable linker of the complex is preferable orthogonal to 
the cleavable linker that attached the target to the solid support. The cleav- 
able linker may be cleaved to separate the identifier oligonucleotides of com- 
plexes having affinity towards the targets. Just to mention a single type of 
orthogonal cleavable linkages, one could attached to target to the solid sup- 
port through a linkage that can be cleaved by a chemical agent, and the 
linker separating the display molecule and the identifier oligonucleotide may 
be selected as a photocleavable linkage. More specifically, the former linkage 
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may be a disulphlde bond that can be cleaved by a suitable reducing agent 
like DTT (dithiothreitol) and the latter linkage may be an o-nitrophenyl group. 

There are other partitioning and screening processes which are compatible 
with this invention that are known to one of ordinary skill in the art. Such 
known process may be used In combination with the present inventive 
method. In one embodiment, the complex-target aggregate can be fraction- 
ated by common methods and then each fraction Is assayed for activity. The 
fractionization methods can include size, pH. hydrophobicity. etc. 



Inherent in the present method is the selection of encoded molecules on the 
basis of a desired function: this can be extended to the selection of mole- 
cules with a desired function and specificity. Specificity can be required dur- 
ing the selection pnDcess by first extracting complexes which are capable of 
1 5 Interacting with a non-desired "target" (negative selection, or counter- 
selection), followed by positive selection with the desired target. As an exam- 
ple, inhibitors of fungal cytochrome P-450 are known to cross-react to some 
extent with mammalian cytochrome P-450 (resulting In serious side effects). 
Highly specific inhibitors of the fungal cytochrome could be selected from a 
20 library by first removing those complexes capable of interacting with the 
mammalian cytochrome, followed by retention of the remaining products 
which are capable of Interacting with the fungal cytochrome. 



In a certain embodiment, a binding platform may be constructed that can be 
used for almost any target. The binding platform should preferably be small 
enough to only allow association of a few or a single target molecule. This to 
ensure a solution based selection procedure with adjustable target concen- 
tration. The binding platform is primarily composed of two components; a 
small surface allowing association of the target molecule, and an association 
area/site for the target oligonucleotide. This binding platform may be de- 
signed to mediate the association of the target and target oligonucleotide to 
allow proximity selection in solution. 
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r.iftavable linkers 

A cleavable linker may be positioned between the target and a solid support, 
between the display molecule and the identifier oligonucleotide, or any other 
position that can provide for a separation of the identifier oligonucleotides of 
successful complexes from non-specific binding complexes. The cleavable 
linker may be selectively cleavable. i.e. conditions may selected that only 
cleave that particular linker. 

The cleavable linkers may be selected from a large plethora of chemical 
structures. Examples of linkers include, but are not limited to. linkers having 
an enzymatic cleavage site, linkers comprising a chemical degradable com- 
ponent, and linkers cleavable by electromagnetic radiation, such as light. 



1 5 Examples of linkers cleavable by electromagnetic radiation (light) 



o-nitrobenzyl 



NO2 




X-T "y=o 



p-alkoxy 
P r2 
o 

hv 



20 



25 



O-nitrobenzyl in exo position 
NO2 

For more details see Holmes CP. J. Org. Chem. 1997. 62. 2370-2380 



3-nitrophenyloxy 
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O2N h 

For more details see Rajasekharan Plllal, V. N. Synthesis. 1980. 1-26 



Dansyl derivatives: 

/I 6 

o=s=o 




For more details see Rajasekharan Pillai. V. N. Synthesis. 1980. 1-26 



Coumarin derivatives 

hv h-nr2r3 



nr2r3 



H-Donor 

For more details see R. O. Schoenleber. B. Giese. Synlett 2003. 501-504 

R^ and R^ can be either of the potential drug candidate and the identifier oli- 
gonucleotide, respectively. Alternatively. R^ and R^ can be either of the target 
or a solid support, respectively. 
r3 = H or OCH3 

If X Is O then the product will be a carboxylic acid 
If X is NH the product will be a carboxamide 

One specific example is the PC Spacer Phosphoramidite (Glen research 
catalog # 10-4913-90) which can be introduced in an oligonucleotide during 
synthesis and cleaved by subjecting the sample in water to UV light (~ 300- 
350 nm) for 30 seconds to 1 minute. 
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DMT = 4,4'-Dimethoxytrityl 
iPr = Isopropyl 
CNEt = Cyanoethyl 



The above PC spacer phosphoamidite is suitable incorporated in a library of 
complexes at a position between the indentifier and the potential drug candi- 
date. The spacer may be cleaved according to the following reaction. 




O-P-O-R^ 




H 

R1 and r2 can be either of the encoded molecule and the identifying mole- 
cule, respectively. In a preferred aspect R^ is an oligonucleotide Identifier and 
the R^ is the potential drug candidate. When the linker is cleaved a phos- 
phate group is generated allowing for further biological reactions. As an ex- 
ample, the phosphate group may be positioned in the 5'end of an oligonu- 
cleotide allowing for an enzymatic ligation process to take place. 

Examples of linkers cleavable by chemical agents: 

Ester linkers can be cleaved by nucleophilic attack using e.g. hydroxide ions. 
In practice this can be accomplished by subjecting the target-ligand complex 
to a base for a short period. 
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r3 r* 




OH 




10 



15 



and r2 can be the either of be the potential drug candidate or the Identifier 
oligonucleotide, respectively. R^^ can be any of the following: H. CN. F. NO2. 
SO2NR2. 

Disulfide linkers can efficiently be cleaved / reduced by Tris (2-carboxyethyl) 
phosphine (TCEP). TCEP selectively and completely reduces even the most 
stable water-soluble alkyi disulfides over a wide pH range. These reductions 
frequently required less than 5 minutes at room temperature. TCEP is a non- 
volatile and odorless reductant and unlike most other reducing agents, it is 
resistant to air oxidation. Trialkylphosphines such as TCEP are stable in 
aqueous solution, selectively reduce disulfide bonds, and are essentially un- 
reactive toward other functional groups commonly found in proteins. 




H2O 



R'-SH * MS-f^ * 



More details on the reduction of disulfide bonds can be found in Kirley, 
T.L.(1989). Reduction and fluorescent labeling of cyst(e)ine-containing pro- 
20 teins for subsequent structural analysis, Anal. Biochem. 180. 231 and Levl- 
son. M.E.. etal. (1969). Reduction of biological substances by water-soluble 
phosphines: Gamma-globulin. Experentia 25. 126-127. 
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Linkers cleavable by enzymes 

The linker connecting the potential drug candidate with the Identifier oligonu- 
cleotide or the solid support and the target can Include a peptide region that 
allows a specific cleavage using a protease. This Is a well-known strategy In 
5 molecular biology. Site-specific proteases and their cognate target amino 
acid sequences are often used to remove the fusion protein tags that facili- 
tate enhanced expression, solubility, secretion or purification of the fusion 
protein. 

Various proteases can be used to accomplish a specific cleavage. The specl- 
10 ficity is especially Important when the cleavage site Is presented together 

with other sequences such as for example the fusion proteins. Various condi- 
tions have been optimized in order to enhance the cleavage efficiency and 
control the specificity. These conditions are available and know in the art. 

15 Enterokinase is one example of an enzyme (serine protease) that cut a spe- 
cific amino acid sequence. Enterokinase recognition site is Asp-Asp-Asp- 
Asp-Lys (DDDDK). and it cleaves C-terminally of Lys. Purified recombinant 
Enterokinase Is commercially available and Is highly active over wide ranges 
in pH (pH 4.5-9.5) and temperature (4-45*'C). 

20 

The nuclear inclusion protease from tobacco etch virus (TEV) is another 
commercially available and well-characterized proteases that can be used to 
cut at a specific amino acid sequence. TEV protease cleaves the sequence 
Glu-Asn-Leu-Tyr-Phe-Gln-Gly/Ser (ENLYFQG/S) between Gln-Gly or Gln- 
25 Ser with high specificity. 

Another well-known protease is thrombin that specifically cleaves the se- 
quence Leu-Val-Pro-Arg-Gly-Ser (LVPAGS) between Arg-Gly. Thrombin has 
also been used for cleavage of recombinant fusion proteins. Other se- 
30 quences can also be used for thrombin cleavage; these sequences are more 
or less specific and more or less efficiently cleaved by thrombin. Thrombin is 
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a highly active protease and various reaction conditions are known to the 
public. 

Activated coagulation factor FX (FXa) is also known to be a specific and use- 
ful protease. This enzyme cleaves C-teoninal of Arg at the sequence lle-Glu- 
Gly-Arg (lEGR). FXa Is frequently used to cut between fusion proteins when 
producing proteins with recombinant technology. Other recognition se- 
quences can also be used for FXa. 

Other types of proteolytic enzymes can also be used that recognize specific 
amino acid sequences. In addition, proteolytic enzymes that cleave amino 
acid sequences In an un-speclfic manner can also be used if only the linker 
contains an amino acid sequence in the complex molecule. 

Other type of molecules such as ribozymes. catalytically active antibodies, or 
lipases can also be used. The only prerequisite is that the catalytically active 
molecule can cleave the specific structure used as the linker, or as a part of 
the linker, that connects the encoding region and the displayed molecule or. 
In the alternative the solid support and the target. 

A variety of endonucleases are available that recognize and cleave a double 
stranded nucleic acid having a specific sequence of nucleotides. The en- 
donuclease Eco Rl is an example of a nuclease that efficiently cuts a nucleo- 
tide sequence linker comprising the sequence GAATTC also when this se- 
quence is close to the nucleotide sequence length. Purified recombinant Eco 
Rl is commercially available and is highly active in a range of buffer condi- 
tions. As an example the Eco Rl is working in in various protocols as indicted 
below (NEBuffer is available from New England Blolabs): 
NEBuffer 1 : [10 mM Bis Tris Propane-HCl. 10 mM MgCI2. 1 mM dithfothreitol 
(pH 7.0 at aS^C)], 

NEBuffer 2 : [50 mM NaCI, 10 mM Tris-HCI. 10 mM MgCI2. 1 mM dithiothrel- 
tol (pH 7.9 at 25*'C)1, 
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NEBuffer 3 : [100 mM NaCI. 50 mM Tris-HCI. 10 mM MgCI2. 1 mM dithio- 
threitol (pH 7.9 at 25''C)], 

NEBuffer 4 : [50 mM potassium acetate. 20 mM Tris-acetate, 10 mM magne- 
sium acetate, 1 mM dithiothreitol (pH 7.9 at 25'*C)]. 

Extension buffer : mM KCI. 20 mM Tris-HCKPh 8.8 at 25o C). 10 mM (NH4 )2 
S04 . 2 mM MgSO 4 and 0.1% Triton X-100, and 200 mM dNTPs. 

Formation of homo -duolexes 

After the identifier oligonucleotides of complexes comprising display mole- 
cules interacting in a certain desired fashion with a target are partitioned, a 
nucleic acid amplification is usually conducted. The nucleic acid amplification 
method may be PCR or a method aquivalent thereto. The amplification is 
preferably conducted such that the relative destribution of the Individual iden- 
tifier oligonucleotides are retained. According to a certain embodiment, iden- 
tical PCR priming sites to obtain a proportional amplification of the identifier 
oligonucleotides surround the coding section. 

After the formation of duplexes, a denaturing step usually follows. A denatur- 
ing can be obtained by a variety of methods, including increased tempera- 
ture, high salt concentration, presence of organic solvents, certain duplex 
disrupting chemicals, etc. Generally it is preferred to use an elevated tem- 
perature. When the temperature is increased above the melting temperature 
of the duplex the single stranded oligonucleotides are formed. At the elevated 
temperature, single stranded oligonucleotides complementing the partitioned 
Identifier oligonucleotides optionally may be added. 

Homo-duplexes are fomied by subjecting the single stranded mixture to hy- 
bridisation conditions. When the denaturing is obtained by elevating the tem- 
perature, the hybridisation conditions may appropriately be obtained by de- 
creasing the temperature below the melting temperature of the homo-duplex. 
The temperature decrease rate may be adjusted according to the specific 
conditions used to obtain the optimal condition for the formation of the homo- 
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duplexes. If a high temperature decrease rate is selected a lower tendency of 
homo-duplex formation is obtained. Conversely, a low temperature decrease 
rate implies a less tendency of hetero-duplex fomnation. Depending on the 
renaturing conditions used, predominately homo-duplexes or a mixture of 
5 homo- and hetero-duplexes are fomied. When the denaturing Is obtained 
using hybridisation modifying agents, like salt, solvents etc. hybridisation 
conditions may be obtained by desalination in case of salt and evaporation In 
case of solvents. 

10 As mentioned above, the mixture of denatured single stranded oligonucleo- 
tides may be added complementing oligonucleotides to obtain certain desired 
effects. In an aspect of the invention random oligonucleotides are added to 
the single stranded oligonucleotides to lower the general tendency of duplex 
formation. A less tendency may be desirable because the selectivity for the 
1 5 best binding display molecule increases. In another aspect of the invention, 
certain specific complementing oligonucleotides are added to increase the 
probability of duplex formation for certain kind of identifier oligonucleotides. 
The added ollgonucleotides may be immobilized or capable of being immobi- 
lized on a solid support. Alternatively, the addition of certain complementing 
20 oligonucleotides may compensate for an initial library not having an even dis- 
tribution of the concentratton of Individual members. 

In certain applications of the Invention it is prefen-ed to use amplification of 
only one strand, thereby to produce single stranded identifier oligonucleo- 
25 tides. The single stranded identifier oligonucleotides may then be mixed with 
a mixture of oligonucleotides complementing the original oligonucleotides of 
the library. The more frequent abundant single stranded identifier oligonu- 
deotides will be more inclined to form a higher portion of homo-duplex com- 
pared to the less frequently occurring, which will tend to be In single stranded 
30 form or in hetero-duplex form. It may be desired to spike the partitioned frac- 
tion of single stranded identifier oligonucleotides with a higher concentration 
of certain complementing oligonucleotides in order to bias the formation of 
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homo- and hetero-duplexes. Alternatively, certain complementing oligonu- 
cleotides may be added in a lower amount. As an example, the Identifier oli- 
gonucleotide coding for a known ligand may be extinguished because it is 
desired to find unknown ligands. by avoiding the addition of an oligonucleo- 
tide complementing said identifier oligonucleotide. 

In some aspects of the invention, a library of different display molecules, 
each being associated with an identifying oligonucleotide, is divided into two 
or more portions. A first portion may be contacted with a target and the identi- 
fier oligonucleotides of successful display molecules harvested, while a sec- 
ond portion may be contacted with a blank vessel or a second target and the 
identifier oligonucleotides of successful display molecules of the second por- 
tion collected. Prior to the step of denaturing and renaturing. the identifier 
oligonucleotides from the two portions are mixed. The advantages if screen- 
ing two or more portions of a library individually include changing the profile 
of the background, obtaining subtype selectivity of the display molecule etc. It 
is also possible to combine two or more libraries before the contacting with 
the target to obtain an altered profile of binding display molecules. 

Rof-nverv of hom o-duplexes 

The recovery of homo-duplex is typically obtained by removing the hetero- 
duplexes and the single stranded oligonucleotides from the renaturate mix- 
ture. In another aspect of the invention, the homo-duplexes are extracted 
form the mixture. When the hetero-duplexes are removed from the reaction 
mixture, any convenient method can be used, such as a mis-match binding, 
enzymatic or chemical mis-match cleavage, or physical method. 

Mis-match binding 

in a certain aspect of the invention, hetero-duplexes are removed by binding 
to prokaryotic or eukaryotic mis-match binding proteins. An example is MutS. 
a mismatch binding protein isolated from E. coll. which recognises regions of 
double-stranded DNA containing a mismatched base pair (Wagner el al.. 
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1995, Nucleic Acids Research. 22. 1541-1547) as well as 1 to 4 base pair 
insertion-deletion loops. MutS is allowed to bind to the hetero-duplexes and 
bound hetero-dupIex/MutS complexes are removed from the reaction mixture 
using, for example, powdered nitrocellulose. A convenient alternative is to 
use MutS conjugated to magnetic beads, allowing bound heteroduplexes to 
be removed from the reaction mixture with a magnet. MutS may also be con- 
jugated to biotin and the bound hetero-duplexes removed from the mixture 
using streptavidin-coated beads. 

MutY has considerable potential for mismatch detecHon. as its in viVo func- 
tion is to repair mismatched G:A base pairing by cleavage of the adenlne- 
containing strand. Similar proteins thought to be involved in G:T and G:U 
mismatch repair have also been described. Hsu (Carcinogenesis 
1994-15:1657-62) described the use of E. coli MutY protein for the detection 
of mismatched G:A in p53. As low as 1-2% mutant DNA in a sample of mu- 
tant and wild-type DNA could be detected using a synthetic DNA oligonucleo- 
tide to create G/A mis-pairing. 



Enzymatic cleavage 

Chemical and enzymatlcal cleavage methods used for degradation of hetero- 
duplexes must differentially cleave these sequences and retain the homo- 
duplexes. Any sequence difference will result in the formation of a mispairing. 
causing localized distortion of the double helix. Cleavage techniques exploit 
this structural change by selectively degrading or modifying DNA at the site 
of the mismatch. Ideally, little or no cleavage would be seen in a perfectly 
matched DNA fragment, and all distortions of the helix generated by base 
mismatches would result in cleavage. In practice neither criteria a fully met. 
and the utility of a technique becomes a trade-off between ease of use. sen- 
sitivity and specificity. 

in certain aspect of the invention mammalian or bacterial endonudeases are 
used to recognise and cleave the hetero-duplexes at mismatched nucleo- 
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bases (see U.S. Pat. No. 5.824.471). Examples of preferred enzymes include 
bacteriophage resolvases such as T4 endonuclease VII or T7 endonuclease 
I. in a preferred aspect of the invention, thermostable cleavage enzymes 
would be used in order to avoid the necessity of adding fresh enzyme during 
each round of heteroduplex formation and removal. 

An enzyme called "Cleavase" from Third Wave Technologies relies on the 
endonucleolytic cleavage of stem-loop stmctures. The precise nature of the 
thermostable enzyme "Cleavase" has not been published. Since the stem- 
loop profile is dependent upon the primary sequence of the DNA. sequence 
changes in some cases result in a change in the cleavage profile. This 
method is unique in that unlilce all other enzymatic techniques, it does not 
require the formation of a mismatch heteroduplex to generate a site for en- 
zymatic deavage. 

Ribonuclease A cleavage was originally described by Myers et al (Science 
1985: 230:1242-8) using DNA:RNA hybrids. Sensitivity was reported to be 
around 60% per strand cleaved. Grange ef a/ (Nucleic Acids Research 1990; 
18:4227-36) described improved sensitivity by screening both strands of 
RNA. A Non-isotopic RNase Cleavage Assay (NIRCA) has also been de- 
scribed. This assay (commercially available from Ambion) utilized PGR prim- 
ers with phage RNA polymerase promoters so that large quantities of RNA 
were produced. Cleaved products could be detected on agarose gels and 
fragments of up to 1 kilobase were analyzed. Additional RNase enzymes 
such as RNase 1 and RNase T1 increase the sensitivity of the assay. The 
commercial kit includes a helix modifying reagent that makes the mismatches 
more sensitive to cleavage. 

T4 endonuclease VII (T4E7) and T7E1 are small proteins from bacterio- 
phages that bind as homodimers and cleave aberrant DNA structures includ- 
ing Holliday Junctions (and are hence sometimes called "resolvases") though 
it is far from clear that they perform such a role in vivo. Mashal ef at (Nature 
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Genetics 1995; 9:177-83) observed that they preferentially cleaved mis- 
matched hetero-duplexes. leading to the possibility of an enzymatic equiva- 
lent to the chemical cleavage of mismatch. DNA requires no special prepara- 
tion after amplification like GC clamping or including primers with 'phage 
promoters. Background peaks which are seen are highly reproducible and 
may therefore amenable to background subtraction algorithms like those ap- 
plied to DNA sequencing traces. A commercially available T4E7-based muta- 
tion detection kit is available from Amersham-Phannacla. 

A plant endonuclease (CEL I) with similar activity has also been described. 
CEL I is one of series of plant endonucleases with similar activity to nuclease 
S1 but at neutral pH instead of pH 4 or 5. Like T4E7. the cleavage efficiency 
varies according to the mismatch examined and background cleavage is de- 
pendent on the template being examined. 

Uracil glycosylase and photo-activated guanine modification reagent have 
been used to develop a cleavage method that essentially produces T or G 
sequencing tracks. DNA synthesis by PGR requires the incorporation of a 
proportion of uracil bases in place of thymines. These can be removed by 
uracil glycosylase and the abasic site then cleaved by heat or enzymatic 
treatment. 

The present invention also includes any combination of enzymes. 
Chemical method 

Chemical cleavage of mismatches was developed as a modification of the 
Maxam-Gllbert DNA sequencing method by Cotton ef a/ (Proc Natl Acad Sci 
(US) 1988;85: 4397-401). Mismatched thymines are susceptible to modifica- 
tion by osmium tetroxide (or potassium permanganate and tetraethyl ammo- 
nium acetate) and mismatched cytosines can be modified by hydroxylamlne 
The modified bases are then cleaved by hot piperidine treatment. 
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A chemical method can also be used for preventing a PGR amplification to 
be performed. The chemical methods includes DMS-modification, ketoxal- 
modification, DEPT-modification etc. and take advantage of the fact that a 
single stranded oligonucleotide is more volnable to chemical reaction than 
5 the corresponding oligonucleotide in double stranded form. The chemical 
methods usually result in a modification of a nucleobase so that a poly- 
merase cannot recognise the oligonucleotide as a substrate and therefore 
cannot perform a PGR amplification. When a PGR ampiification is performed 
prior to step d only identifier oligonucleotides of homo-duplexes is amplified 
10 while single stranded and hetero-duplexes will be repressed. 

Physical method 

The physical method of recovering homo-duplex molecules involves physical 
separation, such as achieved by chromatography or electrophoresis. Suitable 

15 chromatography methods include column chromatography, affinity chroma- 
tography, size-exclusion chromatography and gel chromatography. Suitable 
gels for gel chromatography include non-denaturing gels, such as agarose or 
polyacrytamide gels. The chromatography can be performed at elevated 
temperatures, e.g. above ambient temperatures, to favour the formation of 

20 homo-duplexes. In a certain aspect of the invention, the temperature is se- 
lected between the average melting temperature of the homo-duplexes and 
the melting temperature of the hetero-duplexes having the least amount of 
mis-matches. Generally, a temperature for performing gel chromatography is 
selected in the range of 45 to 80 degrees Celsius. 

25 

Suitable physical methods notably include denaturing high performance liquid 
chromatography (DHPLC) and chemical or temperature denaturing electro- 
phoresis. Denaturing HPLG is a chromatographic technique capable of sepa- 
rating homo-duplex DNA molecules from a mixture of hetero-duplex and sin- 
30 gle stranded oligonucleotides. The mixture is applied to a stationary reverse- 
phase support and the homo- and hetero-duplex molecules are eluted (under 
thermal or chemical conditions capable of at least partially denaturing hetaro- 
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duplexes) with a mobile phase containing an ion-pairing reagent (e.g. 
triethylammonium acetate; TEAA) and an organic solvent (e.g. acetonitrlle; 
AcN). DHPLC can also allow the direct quantisation of relative homo-duplex 
and hetero-dupiex concentrations by the detection of ultraviolet absorbance 
or fluorescent emission of/from the separated species. The area under the 
absorbance/emission peak is proportional to the amount of product, which 
therefore allows quantitative assessment of the relative proportions. DHPLC 
is described in Liu W et al. (Nucleic Acids Research. 26:1396-1400. 1998 
and O'Donovan MC et al. Genomics. 52:4449. 1998). 

A preferred method for use in the instant invention to separate hetero-dupiex 
and homo-duplex molecules is as described In U.S. Pat. No. 5.795.976. 
which is incorporated herein by reference. 

nftterminina th^ irientifier nliaonucleotide sequence 

The nucleotide sequence of the identifier sequence Is determined to identify 
the Identity of the binding display molecule(s). In a certain embodiment of the 
invention, chemical entities that participated in the formation of the display 
molecules that binds to the target are identified. The synthesis method of the 
display molecule may be established if information on the chemical entities 
as well as the point in time they have been incorporated in the display mole- 
cule can be deduced from the identifier oligonucleotide. It may be sufficient to 
obtain information on the chemical structure of the various chemical entities 
that have participated in the formation of the display molecule to deduce the 
full molecule due to structural constraints during the formation. As an exam- 
ple, the use of different kinds of attachment chemistries may ensure that a 
chemical entity can only be reacted at a single position on a scaffold. Another 
kind of chemical constrains may be present due to steric hindrance on the 
scaffold molecule or the chemical entity to be transferred. In general how- 
ever, it is preferred that information can be infen-ed from the identifier oli- 
gonucleotide sequence that enable the identification of each of the chemical 
entities that have participated in the formation of the encoded molecule along 
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with the point in time in the synthesis history the chemical entities have been 
incorporated in the (nascent) display molecule. 

Although conventional DNA sequencing methods are readily available and 
useful for this determination, the amount and quality of isolated bifunctional 
molecules may require additional manipulations prior to a sequencing reac- 
tion. 

Where the amount is low. it is preferred to Increase the amount of the oli- 
gonucleotide sequences by polymerase chain reaction (PGR) using PGR 
primers directed to primer binding sites present in the identifier oligonucleo- 
tide sequence. 

In one embodiment, the different identifier oligonucleotide sequences are 
cloned Into separate sequencing vectors prior to determining their sequence 
by DNA sequencing methods. This is typically accomplished by amplifying 
the different Identifier oligonucleotide sequences by PGR and then using a 
unique restriction endonuclease sites on the amplified product to directionally 
clone the amplified fragments into sequencing vectors. The cloning and se- 
quencing of the amplified fragments then Is a routine procedure that can be 
carried out by any of a number of molecular biological methods known in the 
art. 

Alternatively, the bifunctional complex or the PGR amplified identifier oli- 
gonucleotide sequence can be analysed in a microarray. The array may be 
designed to analyse the presence of a single codon or multiple codons in a 
Identifier oligonucleotide sequence. 

Another approach, the identifier oligonucleotide product is analysed by 
QPGR. Preferably, the QPGR affords information as to the chemical moieties 
that has participated in the formation of the display molecules. The QPGR 
approach also allows a direct investigation of the enrichment factor If two 
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samples are analysed fn parallel, said samples being collected before and 
after the use of the present method. Various conditions can be investigated 
to obtain the most optimal selection procedure before the sequences are 
analysed to identify the precise structures of the binding molecules. 

5 

Sequencing can also be performed using pyrosequencing chemistry. A still 
further method for decoding the identifier oligonucleotides comprises high 
throughput sequencing, using single molecule approach. 

10 According to the invention, the homo-duplexes are used to deduce the iden- 
tity of the display molecule(s) interacting with a target. The homoduplexes 
may be used directly from the recovery step or modified by any biotechno- 
iogicat technique prior to decoding. Notably, the modification can include total 
or partial amplification of tiie homo-duplexes to produce a double stranded or 

15 single stranded product. Also the modification may include fragmentation, 

e.g. digestion by a restriction enzyme or another nucleic acid active enzyme. 
In a certain embodiment, a restriction site is positioned between codons to 
allow for a separation of codons of the Identifier oligonucleotides. The frag- 
mentation may facilitate the subsequent decoding, as small nucleic acids 

20 usually are easier to decode. 

The recovered duplexes of step e may one or more times be recycled to step 
d. The recycling may reduce the diversity at the nucleic acid level and at the 
same time increase the probability that an identifier oligonucleotide from a 

25 display molecule interacting with a target is identified when sequencing a lim- 
ited number of homo-duplexes. Usually, when two or more sequences of a 
single identifier are detected during the sequencing, this is an indication that 
a display molecule performing an interaction with the target is identified. The 
repetition of the hetero- and homo-duplexes formation and recovery of the 

30 homo-duplexes may be conducted a suitable number of times until two or 
more identifier oligonucleotides in a sequencing step of a limited number of 
homo-duplexes is revealed. 
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10 



In a certain embodiment of the invention, tlierefore, a decoding of the homo- 
duplex is effected prior to a consecutive repetition. The information of the 
decoding step may be used to modify the composition of the identifier oli- 
gonucleotides and strands complementary thereto. Notably, the composition 
can be modified by removing certain identifier oligonucleotides from the pool. 
As an example, Identifiers of known display molecules interacting with a tar- 
get can be removed to reveal other display molecules in the library having an 
ability to perform the same interaction albeit, to a lesser extend. 



Various methods for excluding certain identifier oligonucleotides are avail- 
able, including removal by an immobilized probe, digestion with a sequence 
specific nuclease etc. An appealing method includes that the partitioned frac- 
tion of Identifier oligonucleotides is SfDlit in two portions. A first portion is then 

15 treated as described above to recover homo-duplexes. The recovered homo- 
duplexes are used to treat the second portion. In a certain aspect of the in- 
vention It IS preferred to amplify the recovered homo-duptexes using primers 
conjugated to biotin. When the amplification product containing biotin is 
mixed with the second portion of the partitioned identifiers under denaturing 

20 conditions and subsequently subject to at least partly renaturing conditions, 
the strands with attached biotin may anneal with a complement in the mix- 
ture. A subsequent treatment with streptavidin or avidin allows the biotin la- 
belled duplexes selectively to be removed from the mixture, 

25 

BREIF DESRIPTION OF THE DESCRIPTION 

Fig. 1 discloses a schematic representation of a selection process. 
Fig. 2 depicts a homo- and a hetero-duplex. 

Fig. 3 shows a library treated in accordance with the method of the invention. 
30 Fig. 4 discloses the overall principle of mis-match selection. 
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DETAILED DISCLOSURE OF THE FIGURES 

Figure 1 discloses the common selection process fn schematic form. In dia 
gram A an ideal library is shown that comprises a variety of different mem- 
bers of a library (Diversity) in the same concentration, i.e. with the same 
5 number of each bifunctional complex in the mixture. 



The library represented in diagram A is subjected to a selection process. 
Generally, the selection process implies that the library is contacted with a 
target to allow for an interaction, usually a binding interaction, to take place. 

10 Ideally, the non-binding members of the library are discharged and the bi- 
functional complexes able to perform an interaction with the target are se- 
lected. However, in reality a certain amount of non-binding complexes are 
eluted together with the binding complexes. The non-binding complexes are 
referred to a background in diagram B. The background usually increases 

15 with the diversity, i.e. a large library generates a higher background relative 
to small library. 



To increase the probability of finding a ligand, it is generally desired to apply 
a library as large as possible. The high background however generates a 

20 high level of noise so that a detection of the hits is difficult or even impossi- 
ble. The present invention suggests a method to reduce the background so 
as to be able to identify the hit. In diagram C the background is broken down 
to individual molecules to illustrate that the amount of the binding ligand is 
higher than each of the molecules in the selected library. The imbalance 

25 formed due to the selection process between the hit and the remainder of the 
library members is then used in the subsequent steps of the present method. 



Fig. 2 schematically discloses a homo-duplex and a hetero-duplex. A homo- 
duplex is an identifier oligonucleotide hybridised to a fully complementing 
30 oligonucleotide. A hetero-duplex is an Identifier oligonucleotide hybridised to 
an oligonucleotide showing one, two, or more mis-matching nucleotides. A 
mis-matching nucleotide is a nucleotide not paired in accordance with the 
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Watson-Crick base-pairing rules, in which A pairs with T (or U) and C pairs 
with G. In Fig. 2, the mis-matching nucleotide(s) are illustrated by a filled cir- 
cle. 

5 The homo- and hetero-duplexes may be formed by denaturing a PGR prod- 
uct of the result of the selection shown on Fig. 1 and subsequently allow the 
mixture to hybridise again. The denaturing is usually performed by heating to 
a temperature above the melting point of the PGR product. The hybridisation 
conditions are usually obtained by lowering the temperature well below the 
10 melting point of the homo-duplexes. 

Fig. 3 illustrates various steps of the present invention. In a first step the li- 
brary is subjected to a selection process as described in Fig. 1 , in which the 
straight line depicts the identifier oligonucleotide from the complex of the 

15 binding display molecule. The output of the selection process is in a subse- 
quent step subjected to a melting and reannealing step. Due to the excess of 
identifier oligonucleotides from the binding display molecule, the mixture of 
homo- and hetero-duplexes comprises a relatively higher content of homo- 
duplexes from the identifiers of the binding display molecules than homo- 

20 duplexes from other sources. In the final step shown on Fig. 3, the homo- 
duplexes are separated from the mixture, e.g. by cleavage of hetero- 
duplexes with an enzyme. The result is an enrichment of identifier oligonu- 
cleotides from binding display molecules. 

25 Fig. 4 discloses a diagram of the overall principle of mis-match selection. Ini- 
tially, a library of bifunctional molecules is subjected to a selection process. 
According to the broadest scope of the invention, any type of bifunctional 
complex library can be used, including phage display, ribosome display, and 
small molecule display. Following the selection, the Identifier oligonucleotide 

30 is amplified using PGR or a similar method in order to generate more copies 
of the individual identifier oligonucleotides. Preferably, the amplification re- 
tains the proportion between the individual identifier oligonucleotides. 
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A number of times the following cycle can be repeated: The PGR ampllcons 
are heated to a temperature above the melting point of the homo-duplexes 
and subsequent cooled to form a mixture of hetero-duplexes and homo- 
duplexes. The homo-duplexes are recovered from the mixture by i) binding 
the mis-matched duplexes to a protein, such as MutS. ii) cleaving the hetero- 
duplex using an appropriate enzyme, such as Gel I. or Hi) physical means, 
such as DHPLC. If the result of the mis-match selection still comprises too 
much noise, i.e. It is not possible when sequencing a small amount of se- 
quences to deduce one or more sequences, which occurs more fi-equently 
than others, the cycle may be repeated, starting with a PGR amplification of 
the output of the mis-match selection. 

The final step of the method includes an analysis of the output. A variety of 
techniques are available to the skilled person in the art. Including sequencing 
using capillary electrophoresis, bead anray. high-density microan-ay etc. If the 
mis-match selection has been successful, an analysis of a relatively few oli- 
gonucleotides will reveal which display molecules of the library that have the 
highest binding affinity, because the Identifier oligonucleotides of complexes 
having display molecules with high binding affinity occurs more frequent 

EXAMPLES 

FXAMPLE 1 Statistical r=.<r..ifltions of mismatch selprtion (MISE) treatment 
in a library after initjal selection. 

Thfioretical procedure: 

This example describes the statistical calculation of MISE treatment simulat- 
ing various libraries with certain size and diversity and different enrichment 
factors in the initial selection process. Below is definition of the parameters 
used in the calculations. 
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Steps (x): 

1 : initial library 

2 : library after selection 

3 : library after first MISE round 

4 : library after second MISE round 

5 : library after tliird MISE round 

6 : library after fourth MISE round 

Total number of molecules in the library in step x 
Total number of molecules in pool p in step x 
Total number of molecules formed in PCR amplification in step x 
Diversity (number of different molecules In library) in step x 
Diversity of pool p (number of different molecules in library) in 

Relative enrichment factor of pool p 
Fraction of homoduplexes surviving MISE treatment 
Fraction of heteroduplexes surviving MISE treatment 



N(t,x) : 
N(p.x) : 
A(t,x) : 
D(t,x) : 
D(p,x) : 
step X 

R(P): 
SO(x) : 
SE(x) : 



STEP 1: Selection. 

In this step the initial library is contacted with target. Unbound templates are 
washed off and bound templates are recovered. The templates will be recov- 
ered according to the binding efficiency of their displayed ligand to the target. 

N(p,1) = N(p.s)*R(p.s) 

STEP 2: Calculation of diversity after selection. 

The selection step can reduce diversity of the library if for some of the back- 
ground binders less than 1 molecule survives selection. 
If N(p.1) is less than D(p.s) then N(p.1) is set equal to D(p.s) [loss of diver- 
sity] 



STEP 3: PCR Amplification of selection output. 



By addition of a known amount of primers in the PGR amplification, the total 
amount of molecules in the library after amplification can be normalized to a 
specific amount [A(t,1)] 

After amplification N(t,1)[post-ampiification] equals A(t,1) 

5 And N(p,1)[posl-ampUfication] equals N(p,1)[pre-amplification] * A(t,1) / N(t,1 )[pre-amplificationJ 

STEP 4: Homo- and hetero-duplex separation. 

The resulting duplexes are denatured by heating and the individual strands 
are reannealed in a random fashion. Thus the re-formation of duplexes takes 
10 place stochastically. 

The frequency of a species in the total library is calculated by 
F{p.1) = N(p.1)/N(t,1) 

Since all species in a pool behave in the same fashion in this simulation, the 
15 frequency is equal for all species in a pooL 

The number of homoduplexes formed in pool [p] is 
0(p.1) = lN(t,1) * (F(p,1))'^2 * D(p.1) 1 / 2 

The number of heteroduplexes involving strands from pool [p] is 
E(p,1) = 2 * F(p.1) * (1-F(p.1)) * D(p.1) * N(t.1) 



STEP 5: Homo- and hetero-duplex separation. 

All duplexes (homo- and heteroduplexes) are reacted with a mismatch- 
specific enzyme that specifically degrades heteroduplexes. It is expected that 
25 a small fraction of homoduplexes is also degraded by the mismatch-specific 
enzyme. 

The number of surviving homoduplexes in pool p is 
O(p.1)-S0(1) 

30 If 0{p,1 ) is then less than zero 0(p,1 ) Is set to zero [less than one molecule 
exists] 
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The number of surviving heteroduplexes in pool p is 
E(p.1) = SE (1) 

If 0(p,1) is then less than zero 0(p,1) is set to zero 
STEP 6: Amplification 

The total library is normalized to N(t.2) corresponding to a PGR step. 
Fraction in pool p of number of total molecules: 

Calculat^'i eyamo/e A 

A model library composed of 5 pools (b-f) that contain species [pool b con- 
tains 16 species, pool c contains 8 species etc.) that bind target specifically 
and can be enriched from 0.1 fold (pool f) to 6.25e-3 fold (pool b). Pool a con- 
tains 1e8 species (minus the sum of the other pools) that bind unspecifically 
and therefore are depleted relatively 'from the library during selection: R(a) - 
1e-7. In this case the selection step reduces the number of molecules in pool 
a from approximately 1e1 1 to lei rR(a) = 1e4 and the number of molecules 
in pool f from 1000 to 100. 



N(t.x) = 1e11 : Total number of molecules in the library in step x 

A(t!x) = lei 1 : Total number of molecules formed in PGR amplification in 

stepx 

D(t.x) = lea : Diversity (number of different molecules in library) in step x 
D(a 1) = lea, D(b.1) = 16, D(c,1) = 8. D(d.1) = 4. D(e.1) = 2. D (f,1) = 1 
R(a) = 1e-7. R(b) = 6.25e-3. R(c) = 1.25e-2. R(d) = 2.5e-2. R(e) = 56-2, R (f) 
= 1e-1 

SO(x) = 0,8 : Fraction of homoduplexes surviving MISE treatment 
SE(x) = le-4 : Fraction of heteroduplexes surviving MISE treatment 



Table 1.1. Fraction of molecules in pool p in step x [N(p.x) divided by N(t.x)l 
after each amplification step. (Numbers in percent). 
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Step Pool (p) 



(X) 







a 


b 


c 


d 


e 


f 


Zw 


1 




99,999969 


0,000016 


0.000008 


0,000004 


0,000002 


0,000001 


0,000031 


2 




95,238095 


0.952381 


0.952381 


0.952381 


0.952381 


0.952381 


4.761905 


3 




63.670137 


1,558650. 


2.656209 


4,851337 


9.241583 


18,022084 


36.329863 


4 




0.529398 


0,050570 


0.251337 


1.591524 


11.378731 


86.198439 


99,470602 


5 




0.000177 


0.000019 


0.000189 


0,008976 


0,867189 


99.123449 


99,999823 


6 




0,000000 


0,000000 


0.000000 


0.000002 


0.004046 


99.995951 


100.000000 



As can be seen the library consisting of > 95 % background binders (poof a) 
after selection (step 2) consists of only 0,5 % background binders and 99 % 
binders (the sum of pools b-f [Zw]) after selection and 2 rounds of MISE 
5 (steps 3 and 4) using the described conditions. 

Calculated example B (lower enrichment factor) 

A model library composed of 5 pools (b-f) that contain species [pool b con- 
tains 16 species, pool c contains 8 species etc.] that bind target specifically 

10 and can be enriched from 0,01 fold (pool f) to 6,25e-4 fold (poo! b). Pool a 
contains 1e8 species (minus the sum of the other pools) that bind unspecifi- 
cally and therefore are depleted relatively from the library during selection: 
R(a) = 1e-7. In this case the selection step reduces the number of molecules 
in pool a from approximately 1e1 1 to 1e1 1*R(a) = 1e4 and the number of 

15 molecules in pool f from 1000 to 10, 

N(t,x) = 1e11 : Total number of molecules in the library in step x 

A(t,x) = lei 1 : Total number of molecules formed in PGR amplification in 

step X 

20 D{t,x) = 1e8 : Diversity (number of different molecules in library) in step x 
D(a.1) = 1e8, D(b,1) = 16, D(c.1) = 8. D(d,1) = 4, D(e.1) = 2, D (f.1) = 1 
R(a) = 1e-7, R(b) = 6,25e-4, R(c) = 1,25e-3, R(d) = 2.5e-3. R(e) = 5e-3. R (f) 
= 1e-2 
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0,8 : Fraction of homoduplexes surviving MISE treatment 
1e-4 : Fraction of heteroduplexes surviving IS/IISE treatment 



Table 1 .2. Fraction of molecules in pool p In step x [N(p.x) divided by N(t.x)] 
after each amplification step. (Numbers in percent). 



1 r 







a 


b 


c 


d 


e 


f 




1 




99.999969 


0,000016 


0.000008 


0,000004 


0.000002 


0.000001 


0,000031 


2 




99.502488 


0.099502 


0.099502 


0.099502 


0.099502 


0,099502 


0,497512 


3 




99.088563 


0.099085 


0,106137 


0,141386 


0.211904 


0,352926 


0,911437 


4 




95,128061 


0.095119 


0,111690 


0.234614 


0.763023 


3.667493 


4,871939 


5 




,19.132181 


0,019130 


0,025357 


0,114356 


1,807232 


78.901744 


80.867819 


6 




0,008267 


0.000008 


0.000011 


0.000098 


0.026939 


99,964675 


99.991733 



As can be seen the library consisting of > 99.5 % background binders (pool 
a) after selection (step 2) consists of only 19 % background binders and > 80 
% binders (the sum of pools b-f [Sb-f]) after selection and 3 rounds of MiSE 
(steps 3,4 and 5) using the described conditions. 

n^lnulatBd e^^mnif^ C (E m nt nfmore efnnient mismatch cleavage) 
A model library composed of 5 pools (b-f) that contain species [pool b con- 
tains 16 species, pool c contains 8 species etc.] that bind target specifically 
and can be enriched from 0.1 fold (pool f) to 6.25e-3 fold (pool b). Pool a con- 
tains 1e8 species (minus the sum of the other pools) that bind unspecifically 
and therefore are depleted relatively from the library during selection: R(a) = 
1e-7. In this case the selection step reduces the number of molecules In pool 
a from approximately 1e11 to lei 1*R(a) = 1e4 and the number of molecules 
In pool f from 1000 to 100. 

N(t,x) = lell : Total number of molecules in the library in step x 
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A(t.x) = 1e11 : Total number of molecules formed in PGR amplification in 
step X 

D(t,x) = 1e8 : Diversity (number of different molecules in library) in step x 
D(a,1) = 1e8, D(b,1) = 16. D(c,1) = 8. D(d.1) = 4. D{e,1) = 2, D (f,1) = 1 
5 R(a) = 1e-7. R(b) = 6,25e-3. R(c) = 1 ,25e-2. R(d) = 2.5e-2, R(e) = 5e-2. R (f) 
= 1e-1 

SO(x) = 0,8 : Fraction of homoduplexes surviving MISE treatment 
SE(x) = 1e-5 : Fraction of heteroduplexes surviving MISE treatment 

10 Table 1,3. Fraction of molecules in pool p In step x [N(p,x) divided by N(t,x)] 
after each amplification step. (Numbers in percent). 



step (x) Pool (p) 







a 


b 


c 


d 


e 


f 




1 




99.999969 


0.000016 


0,000008 


0.000004 


0,000002 


0,000001 


0,000031 


2 




95.238095 


0.952381 


0.952381 


0.952381 


0,952381 


0,952381 


4.761905 


3 




39.292097 


2.026837 


3,971975 


7.862268 


15,642835 


31,203988 


60,707903 


4 




0,022674 


0.023500 


0,177898 


1,388845 


10,985145 


87,401938 


99,977326 


5 




0,000001 


0,000001 


0.000057 


0.006308 


0,783933 


99.209701 


99,999999 


6 




0,000000 


0.000000 


0,000000 


0,000000 


0,003142 


99.996858 


100.000000 



As can be seen the library consisting of > 95 % background binders (pool a) 
15 after selection (step 2) consists of only < 40 % background binders and > 60 
% binders (the sum of pools b-f [It>.fl) after selection and 1 round of MISE 
(step 3) using the described conditions. 

These theoretical examples show that one can extract specific oligonucleo- 
20 tides from a diverse pool based on formation of homo- and hetero-duplexes. 
These examples also illustrate that parameters such as diversity, enrichment 
factor, library size and degree of separation of homo- and hetero-duplexes 
will influence the outcome of the mismatch selection treatment. 
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EXAMPLE 2. General experimental procedure and material for sinaie codon 
oligonucleotides 

Double stranded DNA species are formed by extension of oligonucleotide 
primer F on single stranded templates No and N12 using Sequenase 2.0 (Am- 
5 ersham Biosciences) and buffer. 



Heat-denaturation and annealing of samples: 

A sample of either double stranded No or double stranded Ni2or a mixture of 
these in 5 |jl of water containing 40 mM HEPES pH 7.5. 50 mM NaCI, 16 mM 
MgCb was heated to 95°C for 10 minutes and allowed to anneal by lowering 
the temperature. 



If only double stranded No is present in the sample, this treatment is expected 
to result in the re-formation of homoduplexes. 

If only double stranded N12 (4'^12 = 1 ,7*10^ different species) is present In the 
sample, this treatment is expected to result in the re-formation of homodu- 
plexes and the formation of heteroduplexes. 

If both double stranded No and double stranded Ni2is present in the sample, 
this treatment is expected to result in the re-formation of No and N12 homodu- 
plexes and the formation of Ni2-Ni2and N0-N12 heteroduplexes (containing 1 
or more mismatched base pairs). 



Treatment of samples with mismatch-specific enzyme: 
To a sample consisting of either double stranded No or double stranded N12 
or a mixture of these was added 1 pi (lOx) Surveyor reaction buffer, 0.5 pi 
Enhancer, 1 pi Surveyor Nuclease, and 2.5 pi H20. The samples were then 
mixed and incubated 1 hour at 42°C. Then the samples were heated to 95°C 
for 10 minutes and allowed to anneal by lowering the temperature. To 3 pi of 
each sample was added 1 pi 5xEXT buffer [100 mM HEPES pH 7.5, 750 mM 
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NaCI, 40 mM MgCb], 4 pi H2O, and 1 jJl E. coli Exonuclease VH (10 U/|jl) 
(USB). The samples were then incubated at 37°C for 16 hours. 



QPCR (quantitative real-time polymerase chain reaction) analyses to esti- 
5 mate number of molecules of each species before and after treatment. The 
QPCR was performed using a Taqman probe which is a fluorescence reso- 
nance energy transfer (FRET) probe consisting of a short oligonucleotide 
complementary to one of the amplified strands. The probe contains a fluoro- 
for and a quencher molecule at the 5' and 3' end of the probe, respectively. 

10 This probe is included in the real-time PGR reaction along with the required 
forward and reverse PGR primers. The quencher molecule quenches the 
fluorescence of the fluorofor due to its close proximity on the probe. As the 
Taq polymerase replicates the new strand of the DNA, Its 5'-3' exonuclease 
activity degrades the FRET probe from the 5 -end. This degradation releases 

15 the reporter flurofor from its proximity to the quencher, resulting in fluores- 
cence of the reporter. Accumulation of fluorescence as a result of target am- 
plification was detected in real time in an ABI 7900 HT sequence detection 
system (Applied Biosystems) which contains an optical detection systems 
During the exponential amplification phase, the amount of target should be 

20 doubling every cycle. Quantification analyses use the Cj value, which is the 
point (cycle number) at which the fluorescence signal reaches a specific 
threshold level of detection In the exponential phase. The more abundant the 
template, the earlier this point is reached. The quantity of DNA in the sample 
can be obtained by interpolation of its Ct value vs. a linear standard curve of 

25 Gt values obtained from a serially diluted standard solution. 

Q-PCR reactions 

For 5 ml premix (for one 96-well plate): 

2.5 ml Taqman Universal PGR Master Mix (Applied Biosystems) 
30 450 pi RPv2 (10 pmol/pl) 

25 pi Taqman probe (50 pM) 
1075 pi H2O 



40.5 Ml premix was aliquoted into each well and 4.5 yl of relevant upstream 
PGR primer (Primer F for standard curve and N12) or the No specific primers 
QP1-3 and 5 pi sample (H2O In wells for negative controls) was added. 



•8 



The samples for the standard curve was prepared by diluting Temp4 to 10 
copies/5 Ml and subsequently performing a 10-fold serial dilution of this sam 
pie. 5 Ml was used for each Q-PCR reaction. 

Thermocycling/measurement of fluoresence was performed on an Applied 
Biosystems ABI Prism 7900HT real-time instrument utilizing the cycling pa- 
rameters: 
95''C 10 min 
40 cycles of 
95°C 15 sec 
64°C 1 min 



Oligonucleotides 
N12 oligo: 

5'-GTCAGAGACGTGGTGGAGGAAGTCTTCCTAGAAGCTGGA 
NNNNN 



NNNNNNNTCTAGCAGCTAGTATGAGGTGGTGTCCAAGCTG-3' 



No oligo: 

5'-GGTAGAGACGTGGTGGAGGAAGTGTTCCTAGAAGCTGGA 
TATCTTGAGTTGTi 



GGACTGGTGAGTATGAGGTGGTGTCGAAGGTG-3' 



Primer F: 

5'- CAGGTTGGAGAGGAGGTGATAG -3' 
Primer R: 

5'- GTGAGAGAGGTGGTGGAGGAA-3* 



QP1-3: 



1 
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5'-TCATACTCAGGAGTCGAGAACTGAAGATA-3' 

Temp4: 
5'- 

5 GCTAGAGACGTGGTGGAGGAAGTGTTCCTAGAAGCTGGATATCT- 
GACGTGTTGAC 

GTACACAGTATGACGTGGTGTCCAAGCTG-3' 
TaqMan probe: 

10 5'-6FAM-TGCAGCTTCTAGGAAGAC-MGB-NFQ (Applied Biosystems) 
6FAM : 6-Carboxyfluorescein 
MGB : Minor groove binder 
NFQ : Non-fluorescent quencher 

15 

EXAMPLE 3: Discrimination between homo- and heteroduolexes in an identh 
fier oligonucleotide containing a codon. 

This example shows the possibility to specifically remove/degrade/separate 
heteroduplexes from homoduplexes. This Is illustrates using a mismatch 
20 cleavage enzyme but other techniques such as for example gel separation, 
mismatch binding and column separation can also be used. 

Two samples were subjected to the experimental procedure described in Ex- 
ample 2: 

25 Sample 2A: 1 pmol double stranded No in 5 pi of water containing 40 mM 
HEPES pH 7.5, 50 mM NaCI, 16 mM MgCk 

Sample 28: 1 pmol double stranded Ni2in 5 pi of water containing 40 mM 
HEPES pH 7.5, 50 mM NaCI, 16 mM MgCb 

30 Results of QPGR analyses of sample 2A: 

Number of homoduplexes before MISE treatment: 6, 71 E+08 (6, 

OOE+08 expected) 
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Number of homoduplexes after MISE treatment: 4, 54E+08 
Thus (4, 54 / 6, 71) = 67 % of homoduplexes have survived the MISE treat- 
ment 

5 Results of QPCR analyses of sample 2B: 

Number of heteroduplexes before MISE treatment: 7, 35E+08 

(6,00E+08expected) 

Number of heteroduplexes after MISE treatment: 1 , 1 6E+07 

Thus (1 , 16 y 73, 5) = 1 , 5 % of homoduplexes have survived the MISE 
10 treatment 

That Is the relative survival factor of homoduplexes Is (67 / 1 , 5) = 45 

The relative survival factor obtain in this example is dependent on the ex- 
15 perimental conditions. This can be future optimized if required by tuning the 
mismatch treatment conditions. The relative survival factor could also be dif- 
ferent using other techniques in specifically remove/degrade/separate het- 
eroduplexes from homoduplexes, as for example gel separation, mismatch 
binding and column separation. 

20 

EXAMPLE 4: Enrichment of sequences in excess over a background se- 
quence population. 

This example describes the possibility to enrich a specific oligonucleotide 
among a diverse background of sequences. 
25 Samples were subjected to the experimental procedure described in Exam- 
ple 2: 

A mixed sample containing 1 pmol sample N12 (4'^12 = 1,7*10'' different spe- 
cies of double stranded DNA) and 0.001 pmol of sample No in 5 )jl of water 
containing 40 mM HEPES pH 7.5, 50 mM NaCI, 16 mM MgCb was subjected 
30 to general procedure 2. Then samples were diluted 300 times and analyzed 
by QPCR (input) and the result from one mismatch selection treatment (out- 
put) as described in Example 2 



69 



QPCR analysis result: 



10 



Sample 


#Ni2 


#No 


Fold excess N12 


Input 


5.11E+08 


1,27E+05 


4040 


Output 


3,80E+03 


6,58E+01 


58 



The enrichment factor with the mismatch selection procedure is 70 (4040/58) 
in this experiment. 

This example demonstrates the possibility to enrich for a specific oligonucleo- 
tide sequence (No) among a diverse pool of oligonucleotide sequences (N12). 
The specific sequence (No) is in 17000-fold excess over each specific N12 
sequences but in 1000-fold (4040 experimentally) lower concentration com- 
pared to the entire population of N12 sequences. Thus, although the back- 
ground sequences are in excess, the specific sequence is able to survive the 
mismatch selection treatment. 



15 FXAMPLE fi: Sequential mismatc h selection treatment 

One important feature with mismatch selection is that one can perform multi: 
pie round of treatment. This will permit extreme enrichment factors which 
might be important when the library size is much larger than the enrichment 
factor obtain in the initial selection. 



20 



25 



A mixed sample containing 1 pmol sample N,2 (4'^12 = 1.7*10^ different spe- 
cies of double stranded DNA) and 0.0001 pmol of sample No in 5 pi of water 
containing 40 mM HEPES pH 7.5, 50 mM NaCI. 16 mM MgClzwas subjected 
to condition described in Example 2. 



Following the procedure described in Example 2. the samples were njn on a 
denaturing 10 % polyacrylamide gel and a gel slice bands containing full- 
length templates (No or N12) [estimated using 32P-labeled marker oligos of 
the same length as No and Nizwere excised from the gel and placed In an 
30 eppendorph tube. 100 pi of 1 x EXT buffer [40 mM HEPES pH 7.5. 50 mM 
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NaCI, 16 mM MgCb] was added. DNA was liberated from the gel by freeze- 
thawing using 2 cycles of heating to 99^C and cooling to -20°C. 400 pi H2O 
was added and 5 pi DNA mixture was used for PGR amplification with 5 pmol 
each of primers F and R in a 25 pi reaction using Ready-To-Go beads (Am- 
5 ersham Biosciences). 

Approximately 1 pmol of amplified material (containing No and N12) was sub- 
jected to another round of mismatch selection as described in Example 2, 
except that the Exo Vll treatment step was excluded. Then the sample was 
10 diluted 300 times and analyzed by QPCR 



QPCR analysis result: 



Sample 




#No 


Fold excess N12 


Input 


1.54E+09 


3,42E+04 


46098 


Output 1^* round of general 
procedure 2 


1,30E+04 


1.99E+02 


66 


Output 2"° round of general 
procedure 2 


3,53E+03 


7,28E+02 


5 



The enrichment factor in the 1®* round of mismatch selection treatment de- 
15 scribed in Example 2 is 45098 / 66 = 683 

The enrichment factor in the 2"** round of mismatch selection treatment de- 
scribed in Example 2 (excluding the Exo Vll treatment) is 66 / 5 = 13 
The total enrichment factor from these two mismatch selection treatment is 
8879(683*13). 

20 This example shows two rounds of mismatch selection treatment but this 
treatment can be continued until desired result is obtained. 
This enrichment factor obtained with the mismatch selection treatment can 
then be multiplied with the initial enrichment factor obtained with the standard 
selection procedure to obtain the overall enrichment factor for the complete 

25 selection process. 
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EXAMPLE 6. General experimental procedure and material for multiple 



codon oligonucleotides 



Double stranded DNA species are formed by extension of oligonucleotide 
5 primer ER on single stranded templates ENo, ENe and EN12 using Sequenase 
2.0 (Amersham Biosciences) and buffer. 

Heat-denaturation and annealing of samples: 

A mixture of double stranded ENo, double stranded ENe and double stranded 
10 EN12 in 5 pi of water containing 40 mM HEPES pH 7.5, 50 mM NaCI, 16 mM 
MgCl2 was heated to 95°C for 10 minutes and allowed to anneal by lowering 
the temperature. 

This treatment is expected to result in the re-formation of ENo-ENo. ENe-ENe, 
15 and ENi2-ENi2homoduplexes and the formation of ENo-ENe, EN0-EN12, ENe- 
ENe. EN6-ENi2and EN12-EN12 heteroduplexes (containing 1 or more mis- 
matched base pairs). 

Treatment of samples with mismatch-specific enzyme: 

20 To the 5 pi mixture was added 1 pi (lOx) Surveyor reaction buffer, 0.5 pi En- 
hancer, 1 pi Surveyor Nuclease, and 2.5 pi H20. The samples were then 
mixed and incubated 1 hour at 42°C. Then the sample were heated to 95*^C 
for 10 minutes and allowed to anneal by lowering the temperature. To 3 pi of 
each sample was added 1 pi 5xEXT buffer [100 mM HEPES pH 7.5, 750 mM 

25 NaCI, 40 mM MgCb], 4 pi H2O, and 1 pi E. coli Exonuclease VII (10 U/pl) 
(USB). The samples were then incubated at 37*=*C for 16 hours. 

Then the samples were run on a denaturing 10 % polyacrylamide gel and a 
gel slice bands containing full-length templates (ENo, ENe and ENi2) [esti- 
30 mated using 32P-labeled marker oligos of the same length as the templates) 
were excised from the gel and placed in an eppendorph tube. 100 pi of 1 x 
EXT buffer [40 mM HEPES pH 7.5, 50 mM NaCI. 16 mM MgCy was added. 
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DNA was liberated from the gel by freeze-thawing using 2 cycles of heating 
to 99°C and cooling to -20°C. 400 Ml H2O was added and 5 pi DNA mixture 
was used for PGR amplification with 5 pmol each of primers EF and ER in a 
25 pi reaction using Ready-To-Go beads (Amersham Biosciences). 

A TOPO-TA (Invitrogen) ligation reaction was assembled with 4 pi PGR 
product. 1 pi salt solution (Invitrogen). and 1 pi vector. Water was added to 6 
pi. The reaction was incubated at RT for 30 min. Heat-shock competent 
TOP1 0 E.coli cells were thawed on ice. 5 pi ligation reaction was added to 
the thawed cells and these were Incubated 30 min on ice, heatshocked in 
42-G water for 30 sec. and then put on ice. 250 pi of growth medium was 
added and the mixture was incubated 1 h at 37oG. The mixture was then 
spread on a growth plate containing 100 pg / ml ampicillin and incubate ON 
at 37oG Individual E.coli clones were picked and transferred to PGR wells 
containing 50 pi water. These 50 pi were incubated at 94oC for 5 minutes and 
used in a 20 pi in a 25 pi PGR reaction with 5 pmol of each TOPO primer 
M13 fonward & M13 reverse and Ready-To-Go PGR beads (Amersham Bio- 
sciences). The following PGR profile was used: 94oG 2 min. then 30 x (94oG 
4 sec 50oG 30 sec. 72oC 1 min) then 72oG 10 min. Primers and nucleotides 
were degraded by adding 1 pM :1 EXO/SAP mixture (USB corp.) to 2 pi PGR 
product and incubating at 370G for 15 min and then 80°G for 15 min to heat- 
inactivate the enzymes. 5 pmol T7 primer was added and water was added 
to 12 pi. Add 8 pi DYEnamic ET cycle sequencing Terminator Mix (Applied 
biosystems). A thermocycling profile of 30 x (95oG 20 sec. 50oG 1 5 sec. 
6O0C 1 min) was run. Then 5 pi water was added to each well and sequenc- 
ing reactions were purified using seq96 spinplates (Amersham Biosciences). 
Reactions were run on a MegaBace capillary electrophoresis instrument (Mo- 
lecular Dynamics) using injection parameters 2 kV. 50 sec and run parame- 
ters- 9 kV 45 min and analyzed using Gontig Express software (Informax). 



Oligonucleotides 



73 



ENo oligo: 

5'GTCGAATGCTGTAGCGGTAGGCAGCA£amCGTCGMieACAGCAAA 
TGAG TCGATGTGCTGAGCTAGAT-3' 

ENe oligo: 

5-. GTCGAATGCTGTAGCGGTAGGCAGCMN!S£iCGTCGlV!At^ 
NATNA GTG 

GATGTGCTGAGGTAGAT -3' 
EN12 oligo: 

GTCGAATGCTGTAGCGGTAGGGAGCNmmCGTCGAN^ 
NGNG TC 

GATGTGCTGAGCTAGAT-3' 
EF Oligo: 

5'-GTCGAATGGTGTAGCGGTAG 
ER oligo: 

5'-ATCTAGCTCAGCACATCGAC 
M1 3 forward: 

5'-GTAAAACGACGGCCAG 
M13 reverse: 

5'-CAGGAAACAGGTATGAC 
T7 primer: 

5'-TAATACGACTCACTATAGGG 
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10 



15 



20 



Libraries of bifunctional complexes will preferably contain more than one 
codon allowing encoding of multiple functional entitles in the displayed mole- 
cule. This example describes an identifier containing three variable regions 
that represent three individual encoding codons. In the homo- hetero-duplex 
separation step in the mismatch cleavage procedure multiple short fragments 
are produced that could potentially recombine through overlapping se- 
quences and generated shuffled variants. These shuffled identifiers would 
then contain codons originated from different original Identifiers. This is 
tested in this example. 

Selected libraries with identifier oligos containing 3 codons was modelled 
using templates ENo. ENe, and EN12. 

It is expected that a library Initially containing 4M 2 = 1 .6e7 different species 
will contain an enriched best binder ENo and a pool of not-best binders ENe. 
The enriched pool of not-best Is expected to be diverse (in this case the ENe 
pool contains 4'^6 = 4096 species but not as diverse as the background of 
non-specific binders (EN12 contains 1,6e7 different species). 

Two samples were mixed to model selected libraries: 



Sample 3.1 A 

1 pmol EN12. 0.1 pmol ENe, 0.1 pmol ENo. This corresponds to a 1 .6e7 
member library that has been selected so that the best binder (ENo) has 
been enriched 1.6e6 fold and the not-best binders (ENe) have been enriched 
25 390 fold (1 ,6e6 / 4096). 



Sample 3.1 B 

1 pmol EN12. 0.01 pmol ENe. 0.01 pmol ENo- This corresponds to a 1.6e7 
member library that has been selected so that the best binder (ENo) has 
30 been enriched 1 .6e5 fold and the not-best binders (ENe) have been enriched 
39 fold (1.6e5/4096). 
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The samples were subjected mismatch selection as described in Example 6. 



Sample 3.1A 





EN12 


ENe 


ENo 


Input (sequencing) 


11 


2 


2 


Output (sequencing) 


5 


0 


6 



5 

This corresponds to an ENo enrichment fold of (6/1 1)/(2/(2+2+1 1)) = 4 



INPUT 1 No : 1 Ne : 10 N12 



» FOl 
» BD2 
» G02 

> DQl 
» B03 
» EOl 
» B02 
» F02 

» coi 

> F03 
» A03 
» H03 
» AQ2 
» DQ3 
» C04 



(1) 
(1) 
(32) 
(32) 
(32) 
(29) 
(32) 
(32) 
(S3) 
(32) 
(32) 
(32) 
(32) 
(32) 
(32) 



GCT GTAGCGCT AJSGCAGC^^^CGTi 
GCTGrAGCGGTAGGCAGCB^^CGT' 
GCTGTA(SCGgrAGGCA.GC|i^^CGT' 
GCTGgAGCCSCTAGTCAGC p^^^ GJ' 

GCTGTAGCCrarAGGCAGqrAIACCGTCGATCCA'CAGCAteGA.Gi 




;a?CGA!rGrGCTGAGCTAG 
TCGAT CT GCT GAGCT AG 
^CAGCA^^^GTCGAT GTGCTGAGCT AG 
ICAGCA^^^GTCGAaPCTGCX GAGCT AG 
TCGAITCTGCTlSAGCrAG 



GCTGTAGCGGTAGG<^GCp^AACGTCGACGACCAGCfl:GCAGTO 

GCTGTAGCGGTASGCAGC|TTTirCCGTCG!ATGOT!cAGC:fl!GC^ 

GCTGrAGCGGSPAGGCy^GC,GGTCP'cGTCGATGTACAGCVWWV\:G>^ 

GCTG!PAGCGGrAGTGCAGpCGTACGirCG!A<aiGfi!GCA.GCAa^ 

GCTGTAGCGCTAGGCAGCCATTGCTTaSAGCCy^CAGC^ 

GCTGTAGCGGyAGGCAGGAATAGCGTcdA.C3GGfi!cAG(^^ 

GCT GXAGCGG!? ASGCAGCGGT ACCQI CgIfUVT CA^CAGCA.b(77G^TCGJU*CT6C!PGA(?CP AG 

GCTGTAGC6(3rAGGCAGCp(^A^CGTCGkAGGA!cAGCAbAFG/lGT^ 

GCX GI AGCGGS AGGCATCpG!? AGK:GTGG^?LAAGl!cASCA.b66GjjGICGA!PGa^GCT G^ 

GCIGgAGCGCTAGGCAG CGCTAlfc CTc dfWW^G BicAtSC^ 



Ir AAISE 



AOS 

BOS (2) 

DOS 

A06(2) 

A06 

COS 

BQ6 

E06 

G06(2) 

COS (2) 

K06 (2) 



(33) 
(33) 
(33) 
(35) 
(33) 
(29) 

(34) 
(34) 
(2B) 
(34) 



OUTPUT 1 No : 1 Ne : 10 N12 



GOT GT AGCGGX AGGCAGC 
GGXGPAGCGGTASGCAGC 
GCT GT AGCGGT AGGCAGc! 
GCSGTAGCGGSASGCAGCI 
GCTCTAGCGGTGGGCi 
GCa*Ga*AGCGG!FAGGCAGCl 




[GTCGATGXGCTGAGCTAG 
CGAXCTGCSGAGCTAG 
iGTCGA*FGtrGCTGAOCTA6 
CGA.TGTGCTGAGCTAG 

CGATCTGCT GAGCT AG 

ICAACA ^ggai^ GTCGATGT GCT GAGCT AG 

GCT GTAGCGGTAGGCAGCGATTXCGTCGSCAGTjCAGCAjC^ 

GCT GTAGCGGTAGGCAGClftGTCTjCGTCGjAIGGqCAGGAGGAGAlorCGATCT 



GCSGT AGCGGT AGGCAGC'AGSi££OG7TGS^ACA^CA£K^ 



10 

Sample 3.18 





EN12 


ENe 


ENo 


Input (theoretical) 


100 


1 


1 


Output (sequencing) 


6 


3 


1 



This corresponds to an ENo enrichment fold of (1/(6+3+1 ))/(1 /1 00) =10 
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and an ENe enrichment fold of (3/(6+3+1 ))/(1/100) = 30 



» F03 
» 603 
» AQ3 
» C04 



INPUT 1 No : 1 Ne : 100 N12 



Ir AAISE 



OUTPUT 1 No : 1 Ne : Ififi N12 



JCGTC 

:gtc 



(31) GCT CT AGCGGTAGGCAC-y^jjj^^ 
<29) GCTGTAGCGGTAGGCAGC^gAC^^ - 
(30) GCTCTAGCGGTAGGCAGC^^^CGTi 
(1) GCTGTAGCGGTAGGCAGCjaGiS^CGTi 
a«-»/=r:iT arrrsoAGCAAT AGCGT 



[gTCGAT CT GCT GAGCa* AG 
IGTCGAT GT GCT GAGCTAG 
SgTCGAT CT GCT GAGCT AG 
SGTCGAT GT GCT GAGCT AG 



These two results shows that enrichment of the expected sequences (No. Ne) 
are identified by sequencing and the composition of the identifier sequences 
are kept intact. There is no shuffling between the codons from the No. Ne. N12 
oligonucleotides. 



10 



15 



irv^r^nte ft • Idf^nfifin^flnn afa 'iq ^nH fmm a lihmrv COWpoSet^ of 61875 dif- 

.^r^nf mol e ^"i'>^ ^^ciated with a unique M^ntifer oligonucleo- 
tiri^ hv selecUnn and subse q "*^nflv using mismatch selection (MISE), 

General arrangement of each complex composed of display molecule and 
Identifier oligonucleotide In the library generation: 



hn-Ra-\ 

HN-5'. 



Oligo Ax 



HN 



OKgo a 



y-Rc-NH2 



Oligo Bx 



Oligo bx 



Oligo Cx 



Oligo cx Oligo d 



5' 



77 



n\/en//ew of thfi lihrarv gf^neration procedure: 



First round of iibrarv gftneration (round A) 



% ^ ^ Building block A x (BBax) 



Pnt-N-RA-^ 



OH 



H2N-6*-template 



Pnl-N-RA-\ 

HN— 5-template 



HN— 5-template 



10 



-Pnr corresponds to pentenoyi - an amine protecting group. "R" can by any 
molecule fragment. The chemical used in library generation comprise a pri- 
mary (shown) or a secondary amine. 



15 



c^ftnond round of library neneration (round B) 



HN-S'-template 




Building block B x (BBbx) 



HN-Ra-^ 

HN-*5'-template 



Rb 



\ 
NH 

Pnt 



o 

HN-Ra-\ 



HN—S'-template 



NH2 



20 



Third round of library generation (round C): 
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Building block C x (BBcX) 



HN— 5'-template 



Rb 
o 



o 

him-Ra"^ 

HN-5'-templale 

HN 

V-Rc-NHa 
O 



n^n^r^i nmcBd"r^ ■ I ihr^n. ae n ^r^finn s.nlection and mismatch subsequent 
selection 

First round nf library gfineration (round A) : 

First oligonucleotides of the A series are each modified by adding to each 
type of oligo a small molecule building block (BBax) to the 5' amine forming 
an amide bond. After this step the template is comprised of oligo Ax. 

Fieinond round oflibra n/ aeneratinn ground B) : 

4 nmol of a mixture of different modified A oligos are then split into a number 
tubes corresponding to the number of different building blocks to be used in 
round B. 190 pmol Oligo a and 2 pi heering DNA is added to each tube and 
the DNA material in each tube Is lyophilized. The lyophilized DNA is then re- 
dissolved in 50 pi water and purified by spining through Biospin P-6 columns 
(Biorad) equilibrated with water. 

Addition of building block: 

The DNA material In each tube is again lyophilized and redissolved In 2 pi 
100 mM Na-borate pH 8.0/100 mM sulfo N-hydroxy succlnimide (sNHS). For 
each tube 10 pi building block BBbx (100 mM in dimethyl sulfoxide [DMSO]) 
is preactlvated by mixing with 10 pi l-Ethyl-3-(3-dlmethylaminopropyl)- 
carbodiimide (EDC) (90 mM In dimethylformamlde [DMF]) and incubating at 
SO^C for 30 min. 3 pi of this preactivated mixture is then mixed with the 2 pi in 
each tube and allowed to react 45 min at 30 °C. Then an additional 3 pi 
freshly preactivated BB is added and the reaction is allowed to proceed for 



79 

45 min at 30 °C. The resulting mixture is then purified by spinning through 
Bio-Rad P6 DG (Desalting gel). 

Addition of codon oligonudeotide : 

The DNA material is then lyophllized and redissolved In 10 pi water contain- 
ing 200 pmol oligo Bx (eg. B1) and the corresponding oligo bx (eg. b1). This 
Is done so that the codon in oligo Bx identifies the BBbX added to the DNA 
template. 10 units of T4 DNA ligase (Promega) and 1.2 pi T4 DNA ligase 
buffer is then added to each tube and the mixture is incubated at 20«'C for 1 
hour. The DNA template linked to the small molecules now comprises an Ax 
oligo with a Bx oligo ligated to its 3' end. The reactions are then pooled, an 
appropiate volume of water is allowed to evaporate and the remaining sam- 
ple is purified by spining through Biospin P-6 columns (Biorad) equilibrated 
with water. 

Removal of building block protecting group: 

The pooled sample (~ 50 pO ^ adjusted to 10 mM Na-acetate (pH 5). 0.25 
volumes of 25 mM Iodine in tetrahydrofuran/water (1:1) is added and the 
sample is incubate at 37 °C for 2h. The reaction is then quenched by addition 
of 2 Ml of 1M NaaSaOa and incubation at room temperature for 5 min. The 
complexes are then purified by spining through Biospin P-6 columns (Biorad) 
equilibrated with water 

To remove suiphonamide protecting groups, the sample is adjusted to 50 pi 
100 mM sodium borate pH 8.5 and 20 pi 500 mM 4-methoxy thiophenol (in 
acetonitrile) is added and the reaction is incubated at 25°C ovemight. Then 
the complexes are purified by spinning through Biospin P-6 columns (Biorad) 
equilibrated with water and then lyophilized. 

ThirH miinri nf libra ry neneration (round C) : 

The samples are dissolved in 175 pi 100 mM Na-borate pH 8.0 and distrib- 
uted into 25 wells (7 pi / well). 2 pi 100 mM BBcX in water/DMSO and 1 pi of 
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250 mM DMT-MM is added to each reaction and incubated at 30 °C over- 
nigth. Water is added to 50 |ji and the reactions are then spin purified using 
Bio-Rad P6 DG (Desalting gel) and subsequently water Is allowed to evapo- 
rate so that the final volume is 1 0 pi. 

5 

Addition of building blocl<: 

The DNA material is then lyophilized and redlssolved in 10 pi water contain- 
ing 200 pmol oligo Cx (eg. CI) and the corresponding oligo cx (eg. c1). This 
is done so that the codon fn oligo Cx corresponds to the BBcX added to the 

10 DNA template- 10 units of T4 DNA ligase (Promega) and 1,2 pi T4 DNA li- 

gase buffer is then added to each tube and incubated at 20^C for 1 hour. The 
DNA template linked to the small molecules now comprises and Ax oligo with 
a Bx ligated to its 3' end and a Cx oligo ligated to the 3' end of the Bx oligo. 
The reactions are then pooled, the pooled sample volume is reduced by 

15 evaporation and the sample is purified by spining through Biospin P-6 col- 
umns (Biorad) equilibrated with water. The pooled sample (-- 50 pi) is ad- 
justed to 10 mM Na-acetate (pH 5). 0.25 volumes of 25 mM Iodine in tetrahy- 
drofuran/water (1:1) is added and the sample is incubate at 37 "^C for 2h. The 
reaction is then quenched by addition of 2 pi of 1M Na2S203 and incubation 

20 at RT for 5 min. Then the DNA templates (carrying small molecules) are pu- 
rified by spinning through Biospin P-6 columns (Biorad) equilibrated with wa- 
ter and then lyophilized. 

Final deprotection step 
25 Some building blocks contain methyl esters that are deprotected to acids by 
dissolving the pooled sample in 5 pi 20 mM NaOH, heating to 80 ^C for 10 
minutes and adding 5 pi of 20 mM HCI. 

Final extension step 
30 To ensure that the DNA templates are double stranded prior to selection 

oligo d is extended along the template by adding to the sample 10 pi of 5 X 
sequenase EX-buffer [100 mM Hepes, pH 7.5, 50 mM MgCb, 750 mM NaCI] 



t iQDTo the IFW Imacie Database on 03/02/2005 



81 

and 4000 pmol oligo d. Annealing is performed by heating to 80«C and cool- 
ing to 20 "C. To the sample is then added 500 dNTP . water to 50 pi and 
39 units of Sequenase version 2.0 (USB). The reaction is incubated at 37«C 
for 1 hour. 

Selection 

This library is subjected to selection, whereby binders to the selection target 
are enriched. 

Maxlsorp ELISA wells (NUNC A/S. Denmark) were coated with each 100 pL 
2pg/mL integrin aVpS in PBS buffer [2.8 mM NaH2P04. 7.2 mM Na2HP04. 
0.15 M NaCI. pH 7.2] overnight at 4-C. Then the integrin solution was substi- 
tuted for 200 pi blocking buffer [JBS. 0.05% Tween 20 (Sigma P-9416). 1% 
bovine serum ainumin (Sigma A-7030). 1 mM MnCId which was left on for 3 
hours at room temperature. Then the wells were washed 10 times with block- 
ing buffer and the encoded library was added to the wells after diluting it 100 
times with blocking buffer. Following 2 hours incubation at room temperature 
the wells were washed 10 times with blocking buffer. After the final wash the 
wells were cleared of wash buffer and subsequently inverted and exposed to 
UV light at 300-350 nm for 30 seconds using a trans-illuminator set at 70% 
power. Then 100 pi blocking buffer without Tween-20 was immediately 
added to each well, the wells were shaken for 30 seconds, and the solutions 
containing eluted templates were removed for PCR amplification and then 
used for mismatch selection (MISE). 

Mifsmatch selecf inn (MISE) : 

The double stranded sample is denatured by heating and allowed to cool 
whereby hetero- and homoduplexes are formed: 

Treatment of samples with nucleases: 

To the 5 Ml mixture ia added 1 pi (lOx) Surveyor reaction buffer (Transge- 
nomic). 0.5 pi Enhancer (Transgenomic). 1 pi Surveyor Nuclease (Transge- 
nomic). and 2.5 pi H2O. The samples are then mixed and incubated 1 hour at 



rrinw ^'(^f fiHAH Kir I ISPTO from the ll=W Image Database on 03/02/2005 



82 

42-C. Then the sample is heated to 95°C for 10 minutes and duplexes are 
allowed to form by lowering the temperature. To 3 ii\ of each sample Is added 
1 Ml 5 X EXT buffer [100 mM HEPES pH 7.5. 750 mM NaCI. 40 mM MgCIzl. 4 
pi H2O. and 1 Ml E. coli Exonuclease VII (10 U/pl) (USB). The samples are 
5 then incubated at 37°C for 1 6 hours. 

Purification by polyacrylamide gel electropttoresis: 

Then the samples are mixed with loading buffer and run on a denaturing 10 
0/0 polyacrylamide gel and gel slices containing full-length templates (esti- 

10 mated using 32P-labeled marker oligos of the same length as the templates) 
are excised from the gel and placed In an eppendorph tube. 100 pi of 1 x 
EXT buffer [40 mM HEPES pH 7.5. 50 mM NaCl. 16 mM MgCb] are added. 
DNA Is liberated from the gel by freeze-thawing using 2 cycles of heating to 
990c and cooling to -20«>C. 400 pl H2O is added and 5 pi DNA mixture is 

15 used for PCR amplification with 5 pmol each of forward and reverse primers 
and in a 25 pi reaction using Ready-To-Go beads (Amersham Biosciences). 

Cloning ofMISE products 

ATOPO-TA (Invltrogen) ligation reaction is assembled with 4 pi PCR prod- 
20 uct. 1 pi salt solution (Invltrogen) and 1 pi vector. Water is added to 6 pi. The 
reaction is then incubated at RT for 30 min. Heat-shock competent TOPI 0 
E.coll cells are then thawed on ice and 5 pi of the ligation reaction is added to 
the thawed cells. The cells are then incubated 30 min on ice. heatshocked in 
42«>C water for 30 sec. and then put on ice again. 250 pi of growth medium is 
25 added to the cells and they are Incubated 1 h at 37°C. The medium containt- 
Ing cells is then spread on a growth plate containing 100 pg / ml amplcillln 
and incubated at 37°C for 16 hours. 

Sequencing ofMISE products: 
30 individual E.coli clones are then picked and transferred to PCR wells contain- 
ing 50 pi water. These 50 pt were incubated at 94«>C for 5 minutes and used 
in a 20 pi in a 25 pi PCR reaction with 5 pmol of each TOPO primer M13 for- 
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ward & M13 reverse and Ready-To-Go PGR beads (Amersham Biosci- 
ences). The following PGR profile is used: 94°C 2 min, then 30 x (94°G 4 sec, 
50°G 30 sec, 72^G 1 min) then 72°G 10 min. Primers and nucleotides are 
then degraded by adding 1 EXO/SAP mixture (USB corp.) to 2 pi PGR 

5 product and incubating at 37^C for 15 min and then 80°G for 15 min to heat- 
inactivate the enzymes. 5 pmoi T7 primer is added and water is added to 12 
IJl. Then 8 pi DYEnamic ET cycle sequencing Terminator Mix (Applied bio- 
systems) is added to each well. A thermocycling profile of 30 x (95^C 20 sec, 
50**C 15 sec, 60**C 1 min) is then run. Then 10 pi water is added to each well 
10 and sequencing reactions are purified using seq96 spinplates (Amersham 

Biosciences). Reactions are then run on a MegaBace capillary electrophore- 
sis instrument (Molecular Dynamics) using injection parameters 2 kV, 50 sec 
and run parameters: 9 kV 45 min and analyzed using Contig Express soft- 
ware (Informax). 

15 
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10 



15 



20 



25 



gyamo/e 8.1 (nf^neral ornnedure 8): G f^neration of» lihrarv containing 61875 
different small moleculfif: and ident ifination of a hinder among tfiese by sub- 
jectino fAie jihrarv to taroet selectio n and subsegiif>nt mismatcfi selection 
(MISE) : 

First round of library genera tion (round A) : 
99 different A oligos were used : 
Oligo Ax general structure : 

5'- NSP-ACCTCAGCTGTGTATCGAGCGGCAGCISHQCGTCG-S' 

1 

The underiined part con-esponds to the 5 nucleotide sequence that varies 
among different A oligos. ie. the codon. The remaining sequence is identical 
among the A oligos. 

N : 5'-Amino-Modifier 5 (Glen research cat# 10-1905-90) 

S : Spacer C3 CPG (Glen research caW 20-2913-01) 

P : PC Spacer Phosphoramidite (Glen research cat# 10-4913-90) 

Oligo a: 5'-TGTGCGACGIIIilGCTGCCGCTCGATACACAGCTGAGGT 

I : inosine 

Onto the free NH2 of the 5' amino modifier on these oligos were loaded the 
listed BBax building blocks: 



Oligo 


Codon 


Building 
block 


Building block structure 


Oligo A1 


TGTTC 


BBa1 
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OligoA2 CGAGC BBa2 | 

V-U 



Oligo A3 T GGATA 



BBa3 



MO— ^ 



Oligo A4 I CGCTG I BBa4 




Oligo A5 I GTTAT | BBa5 




Oligo A6 I AGTGC BBa6 



Oligo A7 I ACCTG I BBa7 




Oligo A8 j CTGGT BBa8 



Oligo A9 TAGGA BBa9 





Oligo A10 


ACTCA 


BBa10 


1 


1 Oligo A11 


CTTAG 


j BBa11 
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OligoA12 I CGCAC I BBa12 



OligoAIS TCGCgI BBaIvJ 



Oligo A14 CGGAT BBa14 



OligoAIS GAGAT j BBa15 



Oligo A1 6 I TGTAG BBa16 



Oligo A1 7 GTGTT 1 BBa17 



Oligo A18 AGATG BBa1» 




Oligo A1 9 ATCCT BBa19 



Oligo A20 1 TTGCT BBa20 









ACGTA BBaZI 
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01igoA22 I ATCAC | BBa22 j ^ — /~ 
Oligo A23 I TATCC I BByv^ia 




Oligo A24 I GGPJKG I BBa24 




■ Oligo A25 I CGGTC I BBa25 




Oligo A26 I TGCTT I BBa26 




OUgoA27 | TTAGC | BBa27 | 
Oligo A28 I GCTGA I BBa28 




Oligo A29 



GAACG I BBa29 




Oligo A30 



CATGG I BBa30 




Oligo A31 



TGGTA 1 BBa31 
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Oligo A32 1 TCAAG 



BBa32 




Oligo A33 I ATCGA 



BBa33 




■"Oligo A34 I ATGCA BBa34 



Oligo A35 I ACTAG 




BBa35 




Oligo A36 I TACCT 



BBa36 




Oligo A37 I TACGA 



BBa37 




Oligo A38 CTTCA 



BBa38 




Oligo A39 



CTCTT 



BBa39 




Oligo A40 



TCATC 



BBa40 



re 




• t 



OligoA41 I ATTCC I BBa41 




Ollgo A42 CGACG BBa42 




OHgoA43 CCTGT I BBa43 




OligoA44 I CCTTC I BBa44 




"Oligo A45 I ACACC j BBa45 




OligoA46 I TAACA 1 BBa46 



re 




Oligo A47 I TAACA | BBa47 




Oligo A48 



CCAGGI BBa48 




Oligo A49 



ATGTC I BBa49 
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OligoASO 1 GAGGA 



OligoA51 I GGTCA 



BBa50 



BBa61 




OligoA52 GACTT I BBa52 



OligoA53 I GGTGGj BBa53 





01igoA54 | CAACT I BBa54 




OligoA55 j ATGAG I BBa55 




Oligo A56 j TCTGC | BBa56 | iHT 
Oligo A57 I ATAGG I BBa57 



Oligo A58 



CTAGG BBa58 
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OligoA59 I AAGTG 1 BBa59 




Ollgo A60 I TCCAA I BBa60 




■ Ollgo A61 GCTCT I BBaBI 




OligoA62 1 


GGAGT 1 


BBa62 I 




OUgoA63 I 


AATCG 1 




— ^ 1 


OligoA64 


AAGCT 1 


BBa64 


y- -OM 1 


Oligo A65 


CCGAA 


BBa65 




Oligo A66 


1 TTTGT 


BBa66 


|-^>^)— 1 


Ollgo A67 


1 CCGTG 


BBa67 


1 "'^T' — ^ 1 

1 
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Oligo A68 | TTTCG 



BBa68 




OHgo A69 1 1 


rOAGG I 


3Ba69 I 
1 1 




Oligo A70 1 


GTTGO 


BBa70 I 




1 Oligo A71 1 


AACTA 1 


BBa71 I 




1 Oligo A72 


AACTA 


BBa72 


i > 1 


1 Oligo A73 


CCTCG 


BBa73 




1 Oligo A74 


1 AGCAA 


BBa74 




1 Oligo A75 


TTCCA 


1 BBa75 




1 Oligo A76 


AGACT 


• 1 BBa76 





94 



OUno A89 1 1 


'AGTC E 


3Ba89 I 
1 ^ 




OliaoA90 1 < 


3GGTG 


BBa90 


1 


OligoA91 1 


GTCAG 


BBa91 1 




1 OligoA92 1 


A/^ A AO 

AoAAo 


RR*QS? i 


1 


1 ongoA93 


GCGAG 


BBa93 




1 origoA94 


1 GATGT 


BBa94 




1 OligoA95 


1 TGACT 


BBa95 




1 OligoA96 


1 CGTCT 


BBa96 




Oligo A97 


AGGTC 


: BBa97 


1 6*^^o- L«- ' 1 
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Oligo A98 


CACTC 


BBa98 




Oligo A99 


CAGTT 


BBa99 


















X 








X 



Second round of library generation f round B) : 



25 different combinations of BBbX and [oligo Bx-oligo bx] pairs were used. 

5 

Onto the free NH2-group of tlie above loaded and deprotected BBaX building 
blocks were loaded the below listed BBbX building blocks: 

Oligo Bx general structure : 

10 

5 - HPO^-CAC AAGTACGAACGTGCATCAGAG- 3' 

The underlined part corresponds to the 20 nucleotide sequence that varies 
among different B oligos, ie. the codon. The remaining sequence is identical 
15 among the B oligos. 

The corresponding b oligos used have the general structure : 
5'-HP03- TCC TCTCTGATGCACGTTCGTACT 

Every b oligo can anneal to a specific B oligo. As can be seen the above 
20 shown B oligo can anneal to the above shown b oligo. 
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I Oligo 1 Codon 



Building block 



Oligo Bl 



AGTACGAACGTGCATCAGAG 



Oligo B2 TAGTCTCCTCCAOl ICCATG 



BBb1 



BBb2 



Building block structure 





Oligo B3 TAGATCGTTCCAGAC IACCG 



BBb3 



Oligo B4 TCCAGTGCAAGACTGAACAG BBb4 




TO 



Oligo B5 AGCATCACTAGTCTGTCTGG | BBb5 



Oligo B6 TCTTGTCAACCTTCCATGCe BBb6 





Oligo B7 AAGGACGTTGCTAGTAGG I ti 133^7 



Oligo B8 GGAACCATCAAGATCCTGACS BBb8 



Oligo B9 



ATCTCTGACGAGATCCAAUG [BBb9 






I « 
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lOligoBlO T 


CAAGGTTGGTGGTGTACIC^ d 


BbIO 




Otigo Bll 1 


rr*r*AAOTTGTTC3CTTCCTCG E 


(Bb11 




1 nliriA &12 < 


DtGAGTGTGTAGTACCAAUti t 


JBb12 




Oligo B13 


ATCTTGGTTGTTCTCt:Hi<Jti 1 


3Bb13 




Oligo B14 


TAGTAGCTTvsoAva i MOAvv-rv-r\:> 


BBb14 




Oligo B15 


TTCACTCCATGCAGCA 1 U l ^ 


BBb15 




Oligo B16 


ACGATGGTGATCGATUAAL^^ 


BBb16 


>- 


Oligo B17 


" TTCAGTGCTTGAGCTACCTG 


" BBb17 




Oligo B18 


r TTGGAGTCTTGTTGCAUUA^^ 


" BBb18 




Oligo BIS 


? TCAACCAACTGGTTt; 1 1 


i BBb19 




Oligo B2 


0 TAGTACTCTACACTCiUiViio*. 


B BBb20 


R3c:ps 
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Oligo B21 


TACACCA T GAO 1 i V3V-»/\v:»mv^vj 


BBb21 




Oligo B22 


RC ATCTTGAGTCGTI GAACG 


BBb22 




Oligo B23 


GAGTCATCTCACTGG AGT I 


BBb23 




Oligo B24 


TCCAGCTTCTAGG AAGACAG 


BBb24 




Oligo B25 


CTTCTTGAGTGCACTAGUAi:^ 


BBb25 


^^^^^ 



Thirri mnnd of lib rary generation (round C) : 
5 25 different C oHgos were used : 
Oligo Cx general structure : 

5'- ViPr>Tr^An AArsTACGAA r.QTGCATCAGAG-3' 



10 



15 



The underlined part corresponds to the 20 nucleotide sequence that varies 
among different C oligos. ie. the codon. The remaining sequence is identical 
among the C oligos. 

Onto these oligos were loaded the listed BBcx building blocks: 
Oligo Cx example : S'-HPOs- 

^,pp^,P^^^^^^^Ar^r>Tr::r^AACCTGGTGCGTTCCTCCACCACGTCTCCG 
Oligo cx example : 5'- r.nAnnAGGTTnr.AGGTnOTGCTCG 
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Oligo 


Codon 


Building 
block 


Building block staicture 


Oligo 
CI 


CGAGCAGGACCTGGAACCTGGTGC 


BBc1 






Oligo 
C2 


CTCGACCACTGCAGGTGGAGCTCC 


BBc2 


O 


Oligo 
C3 


CGTGCTTCCTCTGGTGCACCACCG 


BBc3 




Oligo 
C4 


CCTGGTGTCGAGGTGAGCAGCAGG 


BBc4 






Oligo 

C5 


CTCGACGAGGTCCATCCTGGTCGC 


BBc5 


or" — ^ 


Oligo 
C6 


CGTGAGGAGCAGGTGGTCCTGTCG 


BBc6 




Oligo 
C7 


GCTGACACTGGTCGTGGTCGAGGC 


BBc7 




Oligo 
C8 


GCATCTCGAGGACCTGCTCCTGGG 


BBc8 




Oligo 
C9 


CCACGAGGTCTCCACTGGTCCAGG 


BBc9 




3 


Oligo 
CIO 


GCACTGAGCTGCTCCTCCAGGTGG 


BBc10 


H 


OUgo 
Cll 


GCTCCTGTCCTGCACGTCCATCGG 


BBc11 
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Oligol ^ 
C19 


101 

CGTAGCTCGTGCTGGTCCTCCTGG 


BBc19 1 




j Oligol 
C20 1 


CGACGACCACCACCTTGGACACCC " 


BBc20 I 


/ — \ ^ 1 

1 


1 Oligo 1 
C21 1 


CCTACGTCGTGCTGACGTCCTGCC 


BBc21 1 




I Oligo 1 
I CZ2 


f^rtACGACAGCTAGGAGGAGGTGGG 


BBc22 




I Oligo 
1 C23 


1 CTGGTGGAGCTGGACGAGGACAGC 


BBc23 




1 Oligo 
1 C24 


T CAGGACTGG AGGACGACCAGGTCG 


BBc24 


1 HO 

\ 


1 Oligc 
1 C25 


J CGATGCTGCAGACGACCAGCACCC 


BBc25 


1 HO ^ 

1 ^ 



Pina/ fjytenslon 

Oligo d: 5'-CGGAGACGTGGTGGAGGAAC 



5 
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o^oMitc Of ^.^aupn^inn th^ identif i^^ oiinnnnnteotlHPS that TPsnlt from the tar- 
gat selection 



Fraction 



of templates encoding any binder before selection (estimated):1 in 61 . 



Sequenced identifier oligonucleotides after selection (codons conresponding 
to small molecule X are shown in bold italics): 



» G052.abd (53) 

CGGCAGCCATeGCOTOGC^CAATCrTGGTTGTTCTCCTGCG^CCATCTCGA^VCOACC^^ 

cLca^Lc^Lacatw^tctcctccacttccatgaggaccatctcgac^^^ 

1 

» E032.abd (52) 

CGGGTGCCftAGGTGGTGGTCGTCGTCTCGCAGGAGAACAACCAAGATTGTGCGACGGC^ 

CGGCAG;^CCC^T^a.«TTCACTCCA^GC;.TGCG;.GG^CCTG;^^^ 
_ ijn22 abd (52) 

<^^G^Lc^TcLcATCG^ACTTGTTGCTTCCrCGAGGACCATCTCGAC^^ 

i^^G^Lcl^cLcATCCAGTGCAAGACTG^CAG^GGIVCCTGGTT^^^ 

L^^GCC^^ACCGTcLcATC«.CAACTGGTTCTTGGGAGGAC^^^ 

c;sSGcSLalTcLc3.TAGTCTCCTCa«:TTCC^TG«GGI.C^ 

„ E04 sbd (52) 

CGGCAGCGTTATCGTCGCACATCCAGTGCAAGACTGAACAGIVGGACCACrGAfiCT^ 

Lgcagc^c^c^^catccagcitctaggaagacagaggacc^^^^ 

LS^C^TCGTrTLGC^TCGTCCTCTGCT^TGCACTCAl^ 
_ tT062 Elbd (53) 

CGGCAG;GGAT.CGTCGCACAAGTACGAACC^CAT«GAGAG<^CGaCCC^ 

LG^clGLcrcLc;.TCCAGTGCAAGACXGZU.CAGAGGACA^^ 

LGSG^T^CACrcLcATCGAACTTGTTOCTTCCTCGAAGGAC^^^ 

„ abd (53) 

CGGG^CTGGTCGrCrGCAGa.TCGTCCTCTGCTAGTGCACTa«G^ 

;rclGcirrcccISL««TcxTaA<^c«oc^ 
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> B062.abd <53) 

CGCGACCAGGATGGACCTCGTCGAGTCCTCTGTTCAGTCTTGCACTGGATGCGACGGCACTC 
» F032.abd (53) 

GGAACCTGGAa\GTTGGAGACCTCGTGGTCCTCX5TCTACa\AGTa\.TGGTGTATGTGCGACGACC^ 
5 » F022.abd (52) 

CGGCAGCCGCACCGTCGCACyiGCATCTTGAGTCGTTGAACGAGGACTCGACaiCTGCAGGTGGAGCTCCGTT 
» H052.abGl (53) 

CGGCAGCACACCCGTCGCACAGGAACSiTCAAAGATCCTGAGAGGACCATCTCCSACGACCT 
» G022.abd (S3) 
1 0 CGGCTGACGAGGTCCrCGAACTGGTCCrrCACCTAGTAGGAACGTCCTT^ 
» G042.abd (53) 

CGGCAGCTCTGCCGTCGCACACTTCTTGAGTGCACTAGCAGAGGACCACGAGGTCTCCACTGG 

> E042.abd (55) 

CGCTGCTGCTCACCTCGACACCAGGTCCTCCTTGGATCTCGTCAGAGATTGTGC6ACGGCTCGGCTGCCGCT 
15 » E052.abd (54) 

CGGCAGCCACTCCGTCGCACACTGAGTGTGTAGTACCa!iACGAGGACX3A6Ca^CraAGGAGCACGTGTC^ 
» G012.abd (54) 

CGGCAGCGTTCCGTCGCAO^TTCAGTGCTTGAGCTAACTGAGGACACTCGTNGATGATCCTO 
» H032.abd (53) 
20 CGGCAGCATCCTCGTCGCACATAGTAGCTTGGTACGTATGACCGAGGACCAawa^ 

Fraction of identifiers encoding binder X after selection before MISE treat- 
ment: 1 out of 28. Enrichment fold in selection : (1/27)/{1/61 .875) = 2210 fold 
(theoretical maximum is 61875 fold). 



25 



Results of sequencing the identifier olioonucleotides that result from mis- 
match selection (MISE) (codons corresponding to small molecule X are 
shown in bold italicsV 



30 

» MISES_xl40 (69) GGCAGCATrCCCGGTCGCACACTrCTTC5AGr<?CA— 
CTAGCAG^GGACGATSCrSCAOACQnCCATCCaCCCGTrC 

« MISE5_rll2 (192) GGCAGCATTCCCGTCGCACACSTTCTTGAGSTGCa- 
CTA0CAGPiGGACGATGCTGCAeACGACCAGCACCCGTTC 

CTAGCA0hGGACaATeCTGCATACAGACCAGCACCCGT:i:C 
« MISE5_rll5 (201) 

GGCAGCATTaT€n:CGTCGC^CACTTCTTGAGTGCUiGCTA6CA<^GGIKTCGATGCTGCA7^^ 
« MISE5_rlll (194) 
40 GCT:AGCATTCGCCGTtG<:ACACnP!IX:TTGA0TGCA<^ 
« MISE5_rl48 (232) 

GGCAGCATTCCCGrCGCACACrPTCTTGAGTGCATTAGCAtS^^ 
« MISE5_rl23 (208) 

GGCAGCGTTrCGCGTGCGCACACrTCGrTGAGT<3C»ArcrAffCa<aGGACTGATGCXGCTAGACGACC^ 
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« MlSBS_rX37 (212) 

GGCAGCATTCCCGTCGCACACfTTCTZV3A&r£l^ 
« MISE5_rl44 (22B) 

GGCA6CATTCC3^CGTC6CTACAC!rTCT7QA£?TOC»CrA6!rcaG^ 
5 « MISE5_rl32 (215) 

T66CAGCikTTCCX:GTCGCACACr3«»7C»G7GCAC7A6C:A7(S7^GGATC6^ 
« HZSB5_rl26 (235) 

GGCAGCATG7CCCGTCGCTACGC3TCZ7G»S7GK2A7<:TilG(?C^ 
« MISE5_rl05 (247) 

1 0 GGCAGcarrcccGTCGCACAC'ja' yci 'i ^ qrgCMCTA 

« MISBS__rl31 (298) 



15 Fraction of templates encoding binder X after MISE treatment : 12 out of 13 
Enrichment fold : (1 2/1 3)/( 1/28) = 26 (theoretical maximum is 28 fold). 

The output of the selection process shows 26 different sequences. Thus, it is 
not possible to rank the corresponding display molecule in accordance with 
20 their affinity towards the integrin target.The subsequent Mismatch Selection 
allows the clear conclusion that a single display molecule prevails over all 
others. 

Example 9 

25 A library composed of "\0^7 different identifier oligonucleotides of 165 units 
were assembled according to the general methods of example 8. This library 
was PGR amplified. The amplicons was denaturated at 95 degree Celsius 
and allowed to rehybridisere for 2 hours at room temperature. The rehybrid- 
ised product were run on a standard 4% agarose gel next to a specific identi- 

30 fier oligonucleotide at room temperature. The result of the experiment is 

shown in Fig. 5, in which lane 1 shows the DNA size marker, lane 2 shows 
the 10^7 identifier oligonucleotide library, and lane 3 shows a specific identi- 
fier oligonucleotide. 

35 2A: DNA smear corresponding to heteroduplexes in the IC^T member library 
2B: Band corresponding to homoduplexes in the 10'^7 member library 
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3A: Distinct band corresponding to homoduplex of specific identifier oligonu- 
cleotide. 



Example 10 

5 

The same identifier oligonucleotides as produced in example 9 were used. 

Lane 1: Distinct band corresponding to homoduplex 
Lane 2: ^0^7 identifier oligonucleotide library 
10 Lane 3: DNA size marker 

Lane 4: 10'^7 identifier oligonucleotide library 

Lane 5: IC^T identifier oligonucleotide library, heat-denatured 

1 A: Distinct band corresponding to homoduplex 
15 2A: DNA smear corresponding to heteroduplexes in the ^0^7 member library 
2B: Distinct band corresponding to homoduplexes in the lO'^T member library 
4A: DNA smear corresponding to heteroduplexes in the ^0^7 member library 
4B: Distinct band corresponding to homoduplexes in the lO'^? member library 
5A: DNA smear corresponding to heteroduplexes in the 10'^7 member library 

20 

As can be seen in lanes 4 and 5, the homoduplexes formed in the PGR am- 
plified lO'^? member library (band 4B) are denatured by heat (no band corre- 
sponding to homoduplexes in lane 5) resulting in only heteroduplexes being 
present. A period of renaturation following PGR amplification allows homodu- 
25 plex formation (lanes 2 & 4). 

As can be seen, homoduplexes can be resolved on the gels enabling the iso- 
lation and PGR reamplification of these. Thus this is an iterative method for 
enriching homoduplexes. 

30 
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Claims 

1. A method for obtaining display molecule(s) having affinity towards a 
target, comprising the steps of 

a. providing a library comprising a plurality of different display molecules, 
5 each display molecule being associated with an identifier oligonucleo- 
tide, which codes for the identity of said display molecule, 

b. contacting the library with a target to allow for an interaction between 
the display molecules of the library with the target, 

c. partitioning a fraction enriched in identifier oligonucleotides of display 
10 molecules interacting with the target, 

d. subjecting the fraction to denaturing conditions and subsequently to 
conditions at which homo-duplexes renaturate, 

e. recovering the homo-duplexes, and 

f. deducing from the homo-duplexes the identity of the display mole- 
15 cule(s) interacting with the target. 

2. The method according to claim 1 , wherein in step c the identifier oli- 
gonucleotides of the library members are provided in homo-duplex form. 

3. The method according to any of the claims 1 and 2, wherein in step d. 
the renaturing conditions favours formation of homo-duplexes, while forma- 

20 tion of hetero-duplexes is avoided. 

4. The method according to any of the claims 1 to 3, wherein the renatur- 
ing conditions include that a mixture of hetero-duplexes and homo-duplexes 
is formed. 

5. The method according to any of the claims 1 to 4, wherein the homo- 
25 duplexes in step e are recovered by removal of hetero-duplexes and single 

stranded identifier oligonucleotides. 

6. The method according to claim 5, wherein the hetero-duplexes are re- 
moved by enzymatlcally degradation. 

7. The method according or claim 6, wherein the enzyme is a nuclease. 
30 8. The method according to any of the claims 5 to 7, wherein the enzyme 

is selected from T4 endonuclease VII, T4 endonuclease I, CEL I, nuclease 
S1 , or variants thereof. 
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9. The method according to any of the claims 6 to 8. wherein the enzyme 
is thermostable. 

10 The method of claim 1 . wherein the display molecule is a reaction prod- 
uct of two or more chemical entities and the identifier oligonucleotide com- 
prises codons Identifying the chemical entities. 

11. The method according to claim 10. wherein the chemical entities are 
precursors for a structural unit appearing in the display molecule. 

12. The method according to any of the claims 1 0 to 11 . wherein some or 
all of the chemical entities are not naturally occurring a-amino acids or pre- 
cursoi^ tiiereof. 

13. The method according to claim 10. wherein each codon comprises 4 or 
more nucleotides. 

14. The method according to any of the claims 1 to 13. wherein the display 
molecules of the library are non-a-polypeptldes. 

15. The method according to any of the claims 1 to 14. wherein the display 
molecules of the library are non-nudeic acids. 

16 The method according to any of the claims 1 to 15. wherein the display 
molecule has a molecular weight less than 2000 Dalton, preferably less than 
1000. and most prefen-ed less than 500 Dalton. 

17. The method according to any of the preceding claims, wherein the iden- 
tifier oligonucleotide uniquely identifies the display molecule. 

18. The method according to any of the claims 1 to 17. wherein one or 
more chemical entities are transferred to the nascent display molecule by a 
chemical building block further comprising an anti-codon. 

19. The method according to claim 18, wherein the information of the anti- 
codon is transferred in conjunction with the chemical entity to the nascent 
complex. 

20. The method according to any of the preceding claims, wherein the 
chemical entities are reacted without enzymatic interaction. 

21 . The method according to any of the claims 1 to 20. wherein the codons 
are separated by a framing sequence. 
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22. The method according to any of the claims 1 to 21 , wherein a selec- 
tively cleavabfe linker joins the display molecule and the identifier oligonu- 
cleotide. 

23. The method according to claim 22, wherein the linker is cleaved by irra- 
5 diation. 

24- The method according to any of the claims, wherein the library com- 
prises one, two or more different library members. 

26. The method according to any of the claims 1 to 24, wherein the library 
comprises 1 ,000 or more different members. 

10 26. The method according to claim 1 , wherein the molecular target is of a 
biological origin. 

27. The method according to any of the claims 1 to 26, wherein the molecu- 
lar target is Immobilized on a solid support. 

28. The method according to claim 27, wherein the target immobilized on 
15 the support forms a stable or quasi-stable dispersion. 

29. The method according to claims 27 or 28, wherein a cleavable linker is 
present between the solid support and the molecular target 

30. The method according to any of the claims 1 to 29, wherein the molecu- 
lar target is a protein. 

20 31. The method according to claim 30, wherein the protein is selected from 
the group consisting of kinases, proteases, phosphatases, and anti-bodies. 

32. The method according to any of the claims 1 to 30, wherein the molecu- 
lar target and/or the display molecule is a nucleic acid. 

33. The method according to claim 32, wherein the nucleic acid is a DNA or 
25 RNA aptamer. 

34. The method according to any of the claims 30 to 33, wherein the target 
protein is attached to the nucleic acid responsible for the formation thereof. 

35. The method according to any of the claims 1 to 34, wherein the contact- 
ing step includes that a target is mixed with a library of complexes. 

30 36. The method according to claim 35, wherein a target is saturated with a 
known ligand prior to the mixing step. 
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37. The method according to claim 1 , wherein the recovered homo- 
duplexes of step e are amplified prior to decoding the identity of the display 
molecule. 

38. The method according to any of the claims 1 to 37, wherein the parti- 

5 tioned fraction of Identifier oligonucleotides of step c Is amplified by PCR prior 
to step d. 

39. The method of claim 38, wherein the identifier oligonucleotides are pro- 
portionally amplified. 

40. The method according to any of the claims 1 to 39, wherein the recov- 
10 ered homo-duplexes of step e one or more times are recycled to step d. 

41. The method according to claim 40, wherein the recovered homo- 
duplexes are amplified prior to the treatment according to step d. 

42. The method according to claims 39 or 40, wherein a decoding occurs 
before recycling to step d. 

15 43. The method according to any of the claims 40 to 42, wherein the infor- 
mation obtained from the decoding Is used to modify the composition of the 
identifier oligonucleotides or complements thereof before recycling to step d. 
44. The method according to claim 42, wherein the modification includes 
extinction of certain identifiers. 
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Abstract 

The present invention relates to a method for selecting a molecule from a 
library of such molecules associated with identifier oligonucleotides, said 
molecule having affinity towards a target. The method involves contacting the 
5 library with a target to allow for an interaction between the molecules and the 
target and partitioning a fraction enriched in identifier oligonucleotides of 
molecules interacting with the target. After an optional nucleic acid amplifica- 
tion of the partitioned fraction, the fraction Is subjected to denaturing condi- 
tions and subsequently to renaturing conditions at which homo-duplexes are 
10 formed. The homo-duplexes are subsequently recovered and decoded to 
identify the identity of the molecule Interacting with the target. 
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